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Browse in the EOS Era 


I. Abstract 

The Information Sciences Research Group, University of California, Santa Barbara 
has undertaken a program of research and development over the past two years to examine 
the problem of science data browse. Given the tremendous data volumes that are planned 
for future space missions, particularly the Earth Observing System in the late 1990’s, we as 
scientists must understand our needs for access to large spatial databases. 

During this second contract year, we continued to work with a multi-disciplinary 
group of colleagues, to refine our concept of data browse. Further, we have developed 
software to provide a testbed of our concepts, both to locate possibly interesting data, as 
well as view a small portion of the data. We have placed Build II of this software testbed 
on a minicomputer and a PC in our laboratory, and provided accounts for our colleagues to 
use the testbed. A number of colleagues have begun to consider our testbed software as an 
element of their in-house data management plans, and we are continuing to work with and 
advise them. 


Information Sciences Research Group 
University of California, Santa Barbara 


Page 2 



Browse in the EOS Era 


II. Narrative 

Our progress reports have detailed the background and justification for this effort. 
Briefly, space programs planned for the next two decades will be able to provide 
extraordinary volumes of digital data. The Earth Observing System program, planned for 
the late 1990’s, is being designed to be able to produce 1 to 2 terrabits of data per day. 

O ur work in the last contract year is designed to help us focus on the logical interface 
between the working scientist and the tremendous data volumes coming from these 
kinds of programs, which will be stored in geographically distributed repositories. We 
explicitly assume that network capabilities will be in place to support the kinds of data 
transfer that a graphics-based browse capability will require. 

Our working hypothesis has been that there are user-friendly mechanisms which 
may help us to be able to examine reduced-volume representations of elements of a 
dataset (i.e., a browse function), which will permit us to determine the value or 
relevance of this data for further analysis. Two separate functions are required, in a 
broad sense. First, we must be able to locate collections of potentially interesting data. 
This kind of function is recognized in the NASA Ocean Data System, as well as at 
NISSDC and the US Geological Survey (as evidenced in their Earth Science Data 
Directory). Further, we believe that mechanisms must be in place to be able to examine 
some portion of a data granule (as defined by the ESADS workshop in early 1987), to be 
able to make a reliable decision on the utility of the data for a science problem. A good 
example of the latter function is the microfiche browse images which have been 
produced during the Landsat program. 

The definition of browse that was developed at the Earth Sciences and 
Applications Data Systems Workshop in February 1987 has guided our work. The 
definition is: 

A browse system enables a scientist to interactively subset a large number of 
granules or a portion of single granule for analysis. The intention of browse is not 
to duplicate all general software for graphics display of data, or the manipulation 
of images, but to provide the user with a subset of browse information to give a 
"feel" for the data and its coverage. Different disciplines and different data 
sources will probably require different functions to provide the relevant information 
to the users. The browse process must be "fast". For example, screen display 
should be presented in 30 seconds or less so that a significant number of 
granules can be examined in a single session. Since some circumstances require 
that the data upon which the browse is based be recent (i.e., quick look), long 
time delays between data acquisition and availability may not always be 
acceptable. 

The overall software architecture, database design, and hardware we are using 
are described in the papers and presentations section of this report. Overall, the 
software, written largely in Fortran 77, runs on a DEC Microvax II, with the GKS library 
providing device-independent graphics code. Users may work with simple ASCII 
terminals, VT-series terminals, or Tektronix-like monochrome graphics terminals. 
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System access is provided by 300/1200/2400/9600 BPS dial-up lines or via the SPAN 
and ARPANet networks. We provide a list of the off-campus users of the system at the 
end of this report. 

In addition to the scientists who have joined us at the BROWSE meetings we have 
sponsored, we are attempting to stay in contact with a number of other individuals. This 
include: 

Dr. Cecil Bloch 

Research Libraries Group, Jordan Quad, Stanford, CA 94350 
Dr. Harvey Croze, United Nations Environment Program 
PO Box 30552, Nairobi, Kenya 
Dr. Sarah Graves, Computer Science Department 
University of Alabama, Huntsville 
Dr. George Ludwig 

Univ. Colo., Campus Box 10, LASP, Boulder, CO 80309 
Mr. Wayne Moonyhan, United Nations Environment Program 

6, Rue del la Gabelle, CH 1227 Carouge, Geneva, Switzerland 
Mr. Jim Weber 

Public Service Satellite Consortium, Washington, D.C. 

Ms. Denise Wiltshire, USGS Information Systems Div. 

801 National Center, Reston VA 22092 
Dr. J.T. Young, Space Sciences Engineering Center 
University of Wisconsin, Madison 

We have also tried to stay in contact with a number of individuals at the various 
NASA facilities, including: 

Ames Research Center: 

Mr. James Brass, Dr. James Lawless - Ecosystems Branch 
Mr. Gary Shelton - Aircraft Program 
NASA Headquarters: 

Dr. James Dodge 
Dr. Eni Njoku 
Dr. Paul Smith 
Mr. Tony Villasenor 
Mr. Jim Weiss 
Mr. Phil Zion 

Jet Propulsion Laboratory: 

Dr. Rom Blom 

Dr. Tom Logan 

Dr. Chuck Klose (NODS) 

Dr. Victor Zlotnicky (NODS) 

Goddard Space Flight Center: 

Mr. Jim Green (NSSDC, Data Systems Lexicon) 

Dr. Forest Hall (FIFE) 
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Marshall Space Flight Center 

Mr. Michael Goodman (WETNET) 

Dr. Greg Wilson (WETNET) 

In terms of longer-term research and development during the second year, we 
emphasized several work areas. We have ported the majority of the BROWSE testbed to 
an IBM PC/AT, using Oracle as the embedded DBMS. Graphics were created by 
porting the MAPWD Fortran source to Borland’s Turbo-C. The following pages present 
the database schema used in this port, as well as sample relations. 


Information Sciences Research Group 
University of California, Santa Barbara 


Page 5 



BROWSE SCHEMA - ORACLE VERSION 



IMAGE 

BAND 

SENSOR 

SCENOD 

BANDID 

SENSORID 

DATEAQ 

SPECRES 

NAME 

CLOUDCOV 

RADRES 

SENSRTYP 

GEODESC 

SPATRES 

ORIENTATION 

DATAQUAL 

FREQ 

NUFfiWS 

ONJN 

POLAR ZTN 

SWATH 

IMAGPARM 

DEPANGLE 

DESCR 

FORMT 

SCHANNEL 



SENSOR 

BAND 

PLATFORM 

PATH 

RW 

ADDRESS 


PLATFORM 

LOCATOR 

ANCDATA 

PLTFRMID 

DAT AID 

DAT AID 

NAME 

GEODESC 

GEODESC 

MISSION 

LATDEG 

DATATYPE 

SDATE 

LATMIN 

SCALE 

EDATE 

LONGDEG 

SOURCE 

ALTITUDE 

LONGMN 

LOCATION 

ORBIT 


REFERENCE 

REF 



AGENCY 



OVERPASS 




IMGPARMS 

PARMID 

NUMJNS 

NUMSAMPS 


SNSRBND PFMSR DFQRMAT BRWSIMG 

SENSOR SENSOR FROMATID SCENQD 

BAND FLATFORM FORM PROCESSING 

MEDIUM ADDRESS 


















PLATFORM 


PLTFRMID 

NAME 

MISSI 

1 

NOAA 

6 

2 

NOAA 

7 

3 

DMSP 

5D-2 

4 

SEASAT 

1 

5 

LANDSAT 

1 

6 

LANDSAT 

2 

7 

LANDSAT 

3 

8 

LANDSAT 

4 

9 

LANDSAT 

5 

10 

SPOT 

1 

11 

HCMM 

A 

PLTFRMID 

NAME 

MISSI 

12 

SPACE SHUTTLE 


13 

AIRCRAFT 


14 

DMSP 

5D-2 

15 

NIMBUS 

7 

15 records 

selected. 



SENSOR: 


SENSORID 

NAME 

1 

AVHRR 

3 

COASTAL ZONE COLOR SCANNR 

4 

RETURN BEAM VIDICON 

5 

MULTISPECTRAL SCANNER 

6 

THEMATIC MAPPER 

7 

HIGH RESOLUTION VISIBLE 

8 

HEAT CAPACITY MAPPING MIS 

9 

SEASAT SAR 

10 

SHUTTLE IMAGING RADAR 

11 

SHUTTLE-IMAGING RADAR-B 

12 

THEMATIC MAPPER SIMULATOR 


11 records selected 


40 SIR-B/SAR 

41 SIR-B/SAR 

42 TMS/1 

43 TMS/2 

44 TMS/3 

BANDID SCHANNEL 


45 TMS/4 

46 TMS/5 

47 TMS/6 

48 TMS/7 

49 TMS/8 

50 TMS/9 

51 TMS/10 

52 TMS/11 

53 TMS/12 

53 records selected. 


PLATFORM-SENSORS : 


PLATFORM SENSOR 


1 

1 

2 

2 

15 

3 

5 

4 

6 

4 

7 

4 

5 

5 

6 

5 

7 

5 

8 

5 

9 

5 

PLATFORM 

SENSOR 

8 

6 

9 

6 

10 

7 

11 

8 

4 

9 

12 

10 

12 

11 

13 

12 


19 records selected 


SENSOR- BANDS 


SENSOR BAND 


1 

1 

1 

3 

1 

4 

1 

2 

1 

5 

1 

6 

3 

7 

3 

8 

3 

9 

3 

10 

3 

11 

SENSOR 

BAND 

3 

12 

4 

13 

4 

14 

4 

15 

4 

16 

5 

17 

5 

18 

5 

19 

5 

20 

5 

21 

5 

22 

SENSOR 

BAND 

5 

23 

5 

24 

5 

25 

6 

26 

6 

27 

6 

28 

6 

29 

6 

30 

6 

31 

6 

32 

7 

33 

SENSOR 

BAND 

7 

34 

7 

35 

7 

36 

8 

37 

8 

38 


9 39 

10 40 

11 41 


41 records selected 
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As menioned in previous reports, we prepared documentation for both the 
testbed itself, as well as the PC-based image display software created during this 
project. Copies of both of these documents appear in the appendicies of this report. 

Through our collaboratoins with the Research Libraries Group, we are 
participating in an external system design for a comparable system that will handle map 
and other non-remotely sensed data in addition to the image data now our focus. After 
this is completed, we will examine the problems of distributed heterogeneous data base 
query. We are also consider simple expert system approaches to reduce the level of 
expertise a user must have to be able to gain value from the BROWSE testbed. 

In conclusion, this report documents the work we have accomplished during the 
first contract year of NASA Grant NAGW-987, Browse in the EOS Era. We include 
copies of papers written and presentations made during the year, a status report on the 
existing testbed, and an indication of the directions we wish to pursue in the next 
contract period. We look forward to the developments to come, and our continuing 
collaboration with our science advisory team. 
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ABSTRACT 


An intelligent user interface is being added to the NASA-sponsored BROWSE 
testbed facility, developed at the University of California, Santa Barbara, Remote 
Sensing Research Unit. BROWSE is a prototype system to explore issues involved 
in locating image data in distributed archives and displaying low resolution ver- 
sions of that imagery at a local terminal. The original user interface for BROWSE 
incorporated knowledge of the data base structure and query language. Users were 
questioned on their choice of sensors and other attributes, and then BROWSE 
would formulate the appropriate query. The new feature described in this paper 
provides expert consultation to researchers not trained in remote sensing concepts. 
Users are asked about their objectives, and the interface matches the objectives 
with the database domains of image attributes. Natural vegetation applications are 
the focus of the initial development. Expertise in remote sensing of forest and 
range land is being extracted from the literature for use in the knowledge-base. 
This paper describes work in progress on development of the expert system. Shar- 
ing of expertise should encourage a broader research community to utilize remotely 
sensed data. 


INTRODUCTION 


Launch of the Earth Observing System (Eos) sensor packages as part of the Space 
Station complex in the 1990’s will magnify many of the image data management 
issues. Eos has been projected to generate an unprecedented terabyte of image 
data per day for at least a decade. Global science questions can then be meaning- 
fully addressed by researchers in many disciplines, including some who have not 
been involved with remotely sensed data before. Even for experienced remote 
sensing scientists, finding the right dataset can be a significant investment of time. 
Furthermore, the data must often be ordered on the basis of a terse, sometimes 
unreliable, database record of their quality. One approach to these shortcomings 
has been to promote a browsing capability in the data systems (NASA, 1986; 
CODMAC, 1982; CODMAC, 1986; Billingsley, 1984). 

Browsing is defined as the process of locating and viewing image data from a 
remote terminal (Star et al, 1987; Star et al, in press). A prototype browsing sys- 
tem for image data has been developed at the University of California, Santa Bar- 
bara (UCSB) under a grant from the National Aeronautics and Space Administra- 
tion (NASA). The BROWSE testbed basically integrates a database management 
system (DBMS), a user interface, communications software, and an image display 
system. The user is expected to select the appropriate attributes and their values. 

The Eos Data Panel, as well as other authors, have recommended the use of expert 
system technology to improve database access, particularly by new and infrequent 
users (NASA, 1986; Estes et al, 1986). An intelligent user interface, or front-end, 
is a program that will ‘serve as an intermediary between a database and a user who 
has little or no knowledge of the database’s architecture, language, or content’ 
(Campbell et al, in press). NASA anticipates several modes of browsing for Eos 
data, requiring different types of interfaces. The first mode, handled by the initial 
version of BROWSE, visually confirms the quality and coverage of the image 



found in the Catalog. Other users will browse for spatial objects extracted from 
highly processed data. The third form of browsing will be by researchers who may 
not be experts in remote sensing but who need inputs derived from digital imagery. 
We are attempting to support this third group with a new intelligent interface. 
Specifically, we want to evaluate two objectives: 1) can expertise in remote sensing 
capabilities be formalized in a knowledge-base, and 2) would such a sharing of 
expertise be useful to non-experts in selecting image and spatial data? In other 
words, whereas the original BROWSE system incorporated only a low level 
knowledge of the data base structure and contents, the new interface adds a higher 
level of knowledge of the spatial, spectral, and temporal parameters suitable for a 
variety of tasks in a relatively limited domain. 

Following a brief overview of the BROWSE testbed, the related concepts of 
browsing and intelligent interfaces in information science are reviewed. This 
explanation sets the foundation for discussing the intelligent user interface being 
developed for the BROWSE testbed. For prototyping, the initial application is 
remote sensing of forest and range land, where a large knowledge-base has accrued 
in the research literature. There are many potential parameters of vegetation that 
can be derived from image data, and a large number of image sources have been 
used in the past (Cameggie et al, 1983; Heller, 1985). Furthermore, many of the 
global science issues such as deforestation, desertification, climatic change, carbon 
cycling, and acid rain involve vegetative factors (Billingsley, 1984; NASA, 1984). 


THE BROWSE TESTBED 


The prototype BROWSE system was developed under the auspices of a NASA 
grant to explore formats most suitable for viewing spatial data by a range of mul- 
tidisciplinary scientists. However, we have found the display of browse images is 
irrevocably intertwined with the process of retrieving appropriate data. 

The hub of the BROWSE testbed is a relational data base management system. 
The CATALOG data base describes the attributes, or metadata, of each discrete 
image, such as its processing history, image quality, and so on. In the original pro- 
totype, a user interface was provided to guide a researcher in the query process. 
We took this approach to save the user from needing to learn the DBMS language 
and the structure of the data base. An investigator is presented a series of menus 
and prompts for selecting the query attributes. If a user finds an image that 
appears to meet their needs, based on a detailed record, they can transmit a browse 
file from the host computer to their local terminal for display. A fuller description 
of the BROWSE testbed, as well as some of the potential browse formats, are 
given in Star et al (1987 and in press). 

After our experience with the prototype system and the reaction of our scientific 
collaborators, we feel that the display process can not be divorced from the query 
process when evaluating the effectiveness of a browse facility. For instance, our 
use of a multiresolution electronic map to interactively identify a geographic area 
of interest makes the overall browse process easier. (See Figure 1 for an example 
of this graphic feature). For users who are relatively new to remote sensing, i.e., 
the wider constituency that NASA hopes to attract, an effective integration of 
locating and displaying data will be even more important. The original testbed 



provides only a low level of intelligence, or cleverness, in helping a user bridge the 
gap from what they want to what is in the data base. That is, the knowledge-base 
only concerned the DBMS language and the data base contents. The user had to 
supply their own knowledge of the appropriate type of sensor, best dates of cover- 
age, and so on. The new interface provides an additional level of remote sensing 
expertise to translate user objectives into the data base attribute domain. 



Figure 1. Example of Electronic Map for Selecting Geographic Search Area. 


BROWSING FROM THE DATA MANAGEMENT FIELD 


In the data base management field, browsing refers to the process of accessing 
information when the user is unfamiliar with the data base contents and its organi- 
zation. Furthermore, the user is not required to predefine a precise data object as 
the goal of their search (Godin et al, 1986; Robertson et al, 1985; Monarch et al, 
1987; Estes, 1986). Consequently, a browser is a free-form means of interacting 
with a database. A DBMS with a browse feature assists the user in gradually nar- 
rowing the focus of the search, suggesting attributes, related objects, asking 
appropriate questions, just as a human expert might. In other words, the browsing 
mechanism has knowledge of the application domain as well as of the data base 
structure. This section summarizes a few browsers in database systems similar to 
our testbed. A key component of these examples is the concept of views. Users 





have one or more views of a database that stems from their specialty, which deter- 
mines what they expect to see. Simultaneously, there is an architectural view of 
how the database is structured. Thus, browse facilities and intelligent interfaces are 
closely related techniques. . . 

ECO Browser is an intelligent front-end designed to manage and retrieve a loosely 
structured body of ecological observations for ecologists designing dynamic models 
(Robertson et al, 1985). Because ecology is such a diverse field (one might say 
that there are many potential user views of the data base), the browse mechanism 
was kept independent of any specific ecological context. The user is presented a 
graphic display of position in the hierarchical search space. In the initial version, 
the user was given no advice from the interface about which attributes were 
significant to the given objective. 

The RABBIT intelligent database interface (Tou et al, 1982) uses what the authors 
called retrieval by reformulation. RABBIT displays part of a single record, show- 
ing only the values for the attributes selected so far. The user then reformulates 
the query by modifying or adding to the search criteria based on this partial record. 
As with ECO Browser, the user must decide whether the data is suitable. 

The Intelligent User Interface (IUI) is being developed at the NASA Goddard 
Space Flight Center. (Campbell et al, in press). While not specifically called a 
browser, the IUI is designed to support users with little knowledge of the contents, 
structure, or query language of NASA’s scientific databases. Conceptual graphs 
define a path, employing an expert system, from the user view based on their 
domain knowledge, onto the architectural view of how the data base is organized. 


REMOTE SENSING OF FOREST AND RANGE LAND 


Analyzing vegetation cover with airborne sensors has a history of over a 100 years. 
In the 1880’s, foresters used cameras mounted on captive balloons to map forests 
in Germany. Over the last century, a substantial body of experience has accumu- 
lated. Many aspects of vegetation cover have been measured or derived by remote 
sensing techniques. Categorizing these parameters into mapping, monitoring, and 
modeling (Estes, 1986), we can include those shown in Table 1. 

Vegetation is a complex resource, or one could say, there are multiple views of 
vegetation. Forest and range land can be viewed as a commercial resource for 
timber and grazing, as habitat, fuel, watershed, recreation, esthetics, or as a link in 
nutrient cycles, not to mention its influence on climate. On the other hand, vegeta- 
tion has been observed by a wide variety of sensors, both passive and active, 
throughout the electromagnetic spectrum, at many spatial and temporal resolutions. 
Few individuals can master all the relevant disciplines. What we face then is a 
type of application that Feigenbaum and McCorduck (1983) describe as most suit- 
able for expert system development. The expertise is distributed among a rela- 
tively small number of individuals and needs to be fused in a knowledge-base to 
become more comprehensive, up-to-date, and accessible to non-experts. 
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Table 1. Vegetation Parameters Derived from Remote Sensing 

Mapping 

Stand Type 
Stand Structure 
Density or Cover 
Tree Height 
Fuels 

Wildlife Habitat 

Monitoring 

Change Detection 
Range Trend and Condition 
Deforestation/Desertification 
Stress 

Insect/Disease Attacks 
Fire Detection 

Modeling 

B iomass/Yield/V olume 
Leaf Area Index 
Productivity 
Potential Vegetation 
Evapotranspiration 


THE BROWSE KNOWLEDGE-BASE 


With such a massive body of knowledge, in terms of both the vegetation and the 
remote sensing parameters, we had to select a manageable subset of the field. We 
initially picked applications relevant to the vegetation of California, being an area 
with a very diverse mix of landscapes, and one the authors are most famili ar with. 
Papers published in the professional journals and conference proceedings were 
summarized into database records. Each article was abstracted into vegetation and 
image attributes. For example, the data extracted from three of these papers are 
displayed in Table 2. The vegetation based factors correspond to a user view of a 
database, whereas the image attributes correspond more to an architectural view of 
the BROWSE catalog. A factor for ecosystem type was included to match 
research results from similar environments when it is not available explicitly for a 
given geographical region. 

The next step was to convert this database of case studies into a knowledge-base of 
expert system rules. We have initially chosen to use an expert system shell, a 
software tool to develop knowledge-bases, rather than program our own expert sys- 
tem from scratch. The reason for this was that our objective was to attempt to 
represent the knowledge in an expert system format and not to worry about infer- 
ence and representation details at this time. 

One of the tools we are trying is VP-Expert, a commercial PC-based exert system 
shell that uses backward chaining inference to process its rule base (Paperback 
Software, no date). Backward chaining establishes a search goal, and then back- 
tracks through the rules until facts are found that support the conclusion. The goal, 










Table 2. Example Attributes Extracted from Three Research Papers 
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Author 

Franklin 

Yool et al 

Logan 

Vegetation 

Factor 

1. Type map 

2. Density 

3. Biomass 

Fuel type 

: 

Biomass 

Location 

No. California 

So. California 

No. California 

Ecosystem Type 

Temperate 
Conifer Forest 

Sclerophyllous 

Chaparral 

Temperate 
Conifer Forest 

Min.Mapping Unit 

not given 

100 acres 

not given 

Scale 

Regional 

Regional 

Ecosystem 

Platform 

Aircraft 

Landsat 

NOAA 

Sensor 

TMS 

MSS 

AVHRR 

Date 

May 84 
(incidental) 

June 78 

(minimize 

shadows) 

Sept 80 
(avoid snow) 

Time of Day 

09:30 

09:30 

15:00 

Best Spectral 
Bands 

| 

1. NTR, Thermal 

2. Red i 

3. Red 

Red, NIR 

Red 

Role of Data 

1. Type class 

2. Reflectance 

3. Reflectance 

Type class 

Reflectance 


(Franklin, 1986; Yool et al, 1985; Logan, 1983). 
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or set of goals, in our case are the appropriate data attributes of platform, sensor, 
time of year and of day, and spectral coverage. A non-expert is not expected to be 
able to answer questions about the choice of attributes direcdy. Instead, the expert 
system asks about the objectives of the study such as the vegetation factors to be 
derived, in what ecosystem type, and at what minimum mapping unit size. The 
user’s answers supply the facts needed to trigger the appropriate rule. 

A positive feature available in VP-Expert is to allow users to ask why a particular 
question is being asked of them. The default response to a why question is to 
display the rule in the knowledge-base that triggered it. A superior approach we 
will use is to provide a more technical answer. For example, if the original ques- 
tion asked by the expert system is ‘What is your minimum mapping unit size?’, the 
why response might be 'To detect objects, the spatial resolution of the sensor must 
be no more than half of the minimum mapping unit size’ . We believe this 









approach serves as a tutorial so that new users can become more knowledgeable. 

VP-Expert has a facility called INDUCE that automatically generates an initial 
knowledge-base from a table of attributes. The records of the literature summaries, 
as seen in Table 2, were put into a table. VP-Expert induces the rules from the 
table. Then we edit the results, adding answers to why questions, polishing the 
wording of the prompts, and enhancing and verifying the rules. Multiple answers 
are allowed, if, for instance, two or more sensors can both achieve the same task. 
Our approach will be to use half of the literature sources to generate the rules and 
the other half to test them. One question we will be asking is how many examples 
are needed to develop comprehensive rules. Induction is the technique often used 
in machine learning experiments (Michalski et al, 1980). 

The initial prototype is being designed as a stand-alone system on the PC. The 
user only gets the appropriate image attributes and may then log on to the 
BROWSE facility, asking for data with those attributes. Ideally, the expert system 
will be integrated into BROWSE. Then the system could directly pass the selected 
attributes to the DBMS. A further issue to be addressed at that time will be how 
to respond when no data is found. If no imagery is retrieved from the database, 
the system needs to be intelligent enough to 1) suggest alternative archives that 
might have the data, or 2) suggest queries based on similar attributes (Billingsley, 
1984). Defining similarities of ecosystem type, spectral and spatial resolution, geo- 
graphic location, and so on, will be a major challenge since it may turn out to be 
very much application specific. However, a human expert would be expected to 
cope with just this type of ambiguity. 


CONCLUSIONS 


Research experience turns out to be difficult to formalize. It is difficult to be 
comprehensive in what results to include and still be selective for genuine exper- 
tise. Thorough testing by real experts will be necessary to refine the rules. 

The initial prototype under development follows a rather rigid structure. Users are 
given a choice of responses to each question that correspond to the conditions in 
the rules. An 7 don’t know' choice is provided when appropriate, assuming that a 
user may not have a clearly defined task. However, the terms used for the choices 
may not match exactly what the user wants to do, or the terms themselves may be 
imprecise. A faculty to interpret natural language textual input of the user’s objec- 
tives would be a friendlier mode of interacting. However, natural language pro- 
cessing is a complex issue (Robinson et al, 1985) and beyond the scope of the 
current project. 

The intelligent user interface outlined above serves only one of several potential 
classes of browsers. Users that will not benefit from our initial prototype include 
those looking for specific image contents (e.g., mountain bark beetle infestations in 
lodgepole pine forest). In that case, highly processed data (level 2 or higher) is 
required and the object types stored as image attributes (NASA, 1986). In that 
situation, searching would be done on spatial objects rather than on image charac- 
teristics. Complex, or virtual objects, not stored directly could be defined from 
simpler objects (Campbell et al, in press). However, the complexity of extracting 



multidisciplinary objects from imagery acquired at the rates planned for Eos would 
be a staggering task. While an exciting prospect, browsing on image contents must 
be approached cautiously. 

Although we are still in the early stages of development of an intelligent user inter- 
face for the BROWSE testbed, we are encouraged that capturing remote sensing 
knowledge as an expert system is achievable and appropriate. The knowledge 
currently is distributed among a relatively small number of experts. Fusing 
knowledge from many experts and disseminating it as an expert system could 
encourage additional scientists to utilize remotely sensed data in their applications. 
Scientists can then focus more on their research and less on becoming database 
management and remote sensing experts (Estes, 1985). 
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ABSTRACT 

A need exists to develop superior data management and 
information systems for remote sensing and other spatial 
data sets. This issue will become even more significant in 
the near future as remote sensing instrumentation continues 
to produce ever increasing volumes of data, and scientists 
attempt global scale earth science investigations. A 
testbed information system has been created to implement 
and evaluate some proposed solutions to future data 
management issues. In particular, strategies to maximize 
user access to a wide variety of data sources, and large 
scale spatial database design are being considered. By 
implementing a prototype system valuable insight will be 
gained for spatial information system design in the future. 

INTRODUCTION 

Earth science and related technology have progressed to the 
point where the conduct of global science now appears 
feasible (Estes and Star 1986) . This is reflected by the 
growing interest of scientists from a wide range of 
disciplines in performing global scale investigations 
related to the dynamic coupling of the lithosphere, 
hydrosphere, biosphere, and atmosphere. A central element 
of such global scale studies is the utilization of 
advanced, high resolution remotely sensed data. An 
important component in this regard is the suite of sensors 
included in the Earth Observing System (EOS) planned for 
the space station complex of the 1990's (Arvidson et al 
1985) . This system will provide immense volumes (in the 
order of up to one terabyte of data per day) of high 
quality data to the scientific community. 

A critical issue which needs to be addressed in this area 
is the effective management of data derived from EOS era 
sensor systems. Included in this domain are questions 
related to both technical considerations (ie. hardware and 
software) as well as strategic considerations dealing with 
the management of the data. Also, methods of incorporating 
image data and other non- image spatial data into a single 
database environment will be required for geo-based data 
management and analysis systems. Currently, much 
sophisticated database and information management 


technology is available for such tasks. However, superior 
architectures and well considered data management 
strategies are necessary if the data volumes derived from 
EOS era sensor systems are to be effectively used for earth 
system science applications. 

The research described in this paper represents a 
preliminary attempt to clarify and resolve some of the 
issues identified above. Rapid prototyping approaches have 
been employed to create a testbed system (entitled BROWSE) 
in which user needs and data management strategies for 
large scale scientific spatial databases can be evaluated. 
The testbed consists of a data base of spatial (primarily 
image) data, and associated software which will allow the 
user to select data sets of interest, and to preview image 
data on a microcomputer workstation. The emphasis of this 
testbed is geared towards resolving data management issues 
in contrast to more technical considerations related to 
complex software implementations. 

TESTBED ENVIRONMENT 


Objectives and Goals 

In 1982, the Committee on Data Management and Computation 
(CODMAC 1982) recommended that: 

"NASA evaluate and develop the concept of 'electronic 
browse' capability, allowing users to explore data files 
via communication links. We recommend that 'quick-look' 
low resolution data be included for use in electronic 
data browsing." 

Using this type of data access technology, earth scientists 
could potentially have much faster and easier access to 
data than is currently possible. The BROWSE project is 
examining this type of technology and is attempting to 
formalize the definition of so called "electronic browse 
capability." 

There are currently several very large data centers across 
North America which are being used to archive remotely 
sensed data from both airborne and satellite mounted 
sensors (eg. EOSAT, NASA Ocean Data System, Canada Center 
for Remote Sensing, EROS Data Center, etc) . This situation 
has several problems which are either presently an issue or 
will be so in the near future. Despite a wealth of data 
and information available at each site, there exists very 
little communication and exchange of resources between 
these centers. To a large degree, this situation has 

arisen due to the use of incompatible systems (both 
hardware and software) at each site. However, current 

networking and database technology has significant 
potential to overcome much of the bottleneck in this area. 

The BROWSE project is examining implementation strategies 
which maximize user accessibility to remote sensing and 
other ancillary data sets. The research is specifically 
being conducted with a multidisciplinary orientation. In 
this regard, the system has been designed with several key 



objectives in mind. Users must be able to access the 
database from remote locations using user friendly software 
and widely available hardware. Having accessed the 
database a user should then be able to locate appropriate 
data sets, and preview subsetted images at his or her 
workstation. Implicit in these objectives are the 
requirements to define a basic set of attributes which can 
be used to query the database, and also the identification 
of a concise set of processed (eg. vegetation greenness 
indices, principle components, etc.) and subsetted imagery 
which can be employed by the user to select useful data 
sets for a specific application. 

A second major issue is the management of large data 
volumes produced by remote sensing platforms. Current 
sensor systems have already outstripped the ability of data 
management technology to cope with the large volumes of 
data which are being produced. Clearly, a need exists to 
develop data management technology and networking 
strategies which are able to handle much larger data 
volumes, and which facilitate access to this data by as 
wide an audience as possible. 

In this context, there is presently much interest in both 
developing strategies for archiving data sets, as well as 
investigating networking technology to maximize user access 
to remote sensing data sources and reduce data redundancy 
between repositories. There has been a growing awareness 
among the remote sensing community that the archiving of 
data will be an important area requiring careful 
consideration in the near future. Data management 
strategies will be central to this discussion. In this 
regard, decision strategies related to the deletion and 
retention of data sets in geographic databases will be 
fundamental . 

Concurrently, there is a distinct trend towards the 
creation of discipline specific information systems (ie. 
land and climate, atmospheric, oceanographic, etc.) for the 
purpose of data redundancy reduction and modularity of 
information content. Networking technology to maximize 
communications between these various systems will be 
essential if the data is to be utilized in an efficient 
manner. The ultimate goal of such a network system is to 
provide a distributed database of remotely sensed and other 
ancillary data, providing the applications scientist with 
maximum access to a wide variety of data sources. 

The BROWSE project is emphasizing specific components 
within this framework. In particular, database design for 
large scale image databases is being examined and optimal 
strategies for developing distributed spatial databases are 
being considered. The key objective in this regard is to 
consider requirements for information systems which 
maximize user accessibility to data sources. In this 
context, a prototype image database has been created which 
will serve as the model for one node within a distributed 
image database network. Using this test site, design 
strategies for large scale image databases are being 



implemented and evaluated. After the prototype image 
database has been populated, communications software to 
link the BROWSE system with other remote sensing 
information systems will be developed. In this manner, the 
concepts of spatial database management may be tested. 

Database Management Technology 

Database management and information system technology has 
been a field of active research for the past twenty years. 
Current database management systems (DBMS) and their 
associated data models are highly sophisticated and have 
proven to be extremely useful tools for many purposes. 
Unfortunately, these systems have been developed primarily 
for commercial applications. Consequently, they are not 
designed for handling spatial (or geo-referenced) data. A 
primary goal of BROWSE is to develop methodologies which 
effectively adapt existing DBMS technology to handle 
spatial data. 

Several data models (hierarchic, network, relational) have 
evolved in database technology from a need to efficiently 
store and retrieve data and relationships between data. 
The evolution in this regard, particularly in terms of ad 
hoc queries, is the relational model. Unfortunately, in 
many cases the complex nature of spatial relationships make 
them very difficult to represent using conventional data 
models (Peuquet 1984) . At the same time the tabular nature 
of the data representation employed in relational DBMS 
technology is ideal for storing and retrieving attribute 
data. Currently, there exists a high level of interest in 
investigating approaches to the relational model for 
spatial database applications (Lorie and Meier 1984, 
Palimaka et al 1986, Abel and Smith 1986). 

BROWSE Design Methodology 

The goals of BROWSE are to examine design approaches for 
browsing image databases, specifically for use by a multi- 
disciplinary group of users. Thus, the specific design 
attempts to incorporate as wide a range of data types as 
possible. In addition, the system is unrestricted in terms 
of geographic coverage, and is able to reference image data 
sets on a global basis. The specific user scenario for 
which this has been developed is that of the applications 
scientist accessing the database from a workstation in his 
or her office. 

The approach which has been employed in BROWSE is to create 
a testbed environment which can provide valuable experience 
towards developing a data system which allows easy, 
integrated and complete access to past, present and future 
data sets. The primary component of the testbed is a 
database consisting of remotely sensed and other geo- 
referenced ancillary data sets. This database has been 
enhanced by a set of software utilities which aid the user 
in selecting appropriate data sets of interest, and to 
preview the image data once selected. In addition, a 
database of information regarding data centers and network 
nodes is included to assist the user in locating pertinent 
data. Together, this system provides a facility for 


scientists from many disciplines at widely distributed 
locations to locate relevant data collections, browse 
available data sets, view image data, and identify existing 
ancillary data in order to facilitate a decision concerning 
acquisition of the data for the host site. 

The final component of the system is a database which 
contains information about users, their specializations 
(both scientific and geographic) and system usage. A group 
of collaborators have been targeted within the remote 
sensing user community who will be acting as system users 
to provide feedback regarding the various implementation 
strategies which have been developed. In this manner, the 
testbed system can be monitored, evaluated and subsequently 
modified with respect to the specific data types and 
functions which have been included. This input will 
provide valuable insight regarding requirements for EOS era 
data management systems. 

SYSTEM IMPLEMENTATION 
Architecture - Hardware and Software 

The BROWSE system architecture is divided into two logical 
components: hardware architecture and software 
architecture. As indicated above, the emphasis of the 
project is geared towards data management issues. However, 
to a large degree the overall success of the given 
strategies is dependent upon the system architecture. 
Therefore, technical issues must also be considered. The 
hardware architecture is composed of a host minicomputer 
(MicroVAX II) , a remote microcomputer based work station 
(IBM PC-AT) , and associated communications hardware and 
software. Using this configuration, the BROWSE system 
simulates the potential remote sensing application 
scientist who would use this system from his or her office. 
The specific hardware which has been selected for the 
project was chosen in order to be as compatible as possible 
with a diverse range of systems. Further, we have 
attempted to make the software transportable by relying on 
standard practices (eg. using standard Fortran 77) where 
possible. Thus, the end system will be accessible to a 
wide range of users. 

The host minicomputer is acting as a dedicated database 
machine. Running under the VMS operating system 
environment, the system includes a database of remotely 
sensed and other spatial data, and descriptive information 
about that data. The database is managed using a public 
domain relational DBMS. Users are able to access the 
database from remote workstations using standard telephone 
line communications protocols. In this framework, the user 
is able to access descriptive as well as quantitative 
information regarding available imagery, and to quick view 
preprocessed images if desired. 

In designing the system software for the BROWSE project, a 
rapid prototyping approach was employed. This methodology 
is a two stage process geared towards implementing a 
prototype system which will yield useful information for 



designing a final system. The first stage of this process 
involves the development of a model which defines system 
requirements and design criteria. In the second stage the 
model is implemented and system performance is evaluated. 
The ultimate goal of this type of approach is not to 
produce a final system, but rather to implement a testbed 
environment which can be used to exercise ideas and to 
provide valuable experience and guidance towards 
implementing an on-line system with minimal design 
restrictions . 

In designing the BROWSE system it was endeavored to use 
existing software wherever, possible in order to minimize 
the burden of actual software development. A public domain 
relational DBMS (RIM - Relational Information Management) 
has been used to manage the spatial database. A menu 
driven front end to this DBMS has been written in Fortran 
in order to aid the user in specifying queries without 
having to learn the RIM command syntax. Both RIM and the 
user interface software are resident on the host mini- 
computer. 

The second major software component of BROWSE is a set of 
utilities which are resident at the remote user 
workstation. These utilities include communications 
software, graphics software for defining geographic areas 
of interest, and image display software which allows the 
user to view subsetted images of selected data sets. Using 
these local workstation utilities in association with the 
database software resident at the host database node, the 
user is able to identify and preview potentially useful 
data sets for a specific application. Geographic areas of 
interest are specified interactively using graphical 
software to identify the area on a map plotted on the 
workstation screen. Available data sets for that area can 
then be selected based upon date, sensor, cloud cover, and 
other scene specific criteria. Finally, a selected image 
can be subsetted and previewed at the workstation using the 
image display software. 

Database Design 

The BROWSE database is simulating one node in a 
hypothetical distributed database of spatial data. Our 
primary focus is raster format remotely sensed data. 
Attribute information about the image data sets is included 
in the database, and queries are made based on these 
attributes. The database is composed of three major 
components: the DIRECTORY, the CATALOG, and the USER (table 
1) . The DIRECTORY consists of a top level database of 
information regarding the NODES in the distributed 
database, and a description of the holdings at each node. 
The CATALOG is a detailed database of the holdings at each 
NODE. The USER is a database of information regarding the 
various users of the system, their scientific and 
geographic specializations, as well as logging information 
which will allow the system designers to assess system 
usage and adjust to user needs. Thus, the final 
distributed database will consist of a set of NODES, each 
with unique CATALOG and USER databases representing one 


entry in the DIRECTORY. 


The DIRECTORY is resident at each node in the network and 
contains one entry for each node in the network. Each node 
is defined by a set of attributes which specify the types 
of holdings at that node. These attributes include the 
number of data objects available at the node, the 
geographic and scientific data emphasis of the node, and 
the principle collections which are present at the node and 
their format. By querying this database the user can 
identify that node or set of nodes which may contain 
relevant data sets for his or her specific application. 

The CATALOG of each database is unique to the given node, 
and is a detailed inventory of the holdings of that node. 
Standard descriptive attributes about each image in the 
database can then be used to select specific images which 
are appropriate to the users needs. These attributes 
include: date and time of acquisition, platform, sensor, 
band, cloud cover, sun elevation, and any processing 
history which may have been performed on the data. In 
addition, the actual form of the data (ie. printed, film, 
or digital) at the node is included. 


Top level low level joining categories 

Directory geographic coverage principle collections 

Catalog scientific discipline scientific focus (node) 


User sensor type 

band 

platform 

image parameters 

location 

map reference 

data format 


geographic emphasis 
sensor bands 
platform sensors 
discipline focus 
scientific focus (user) 
data emphasis 
nodes queried 
sensors queried 
geographical areas 


Table 1. table outlining the logical information 
content of the BROWSE database. In the actual 
database schema several of the low level and joining 
catagories are shared by the Directory, Catalog, and 
User databases. 

The USER database performs two main functions within the 
BROWSE system. First, this database contains information 
regarding remote sensing applications scientists, and their 
scientific and geographic specializations. Secondly, this 
database is used to perform system usage monitoring. In 
this way, users can obtain information regarding other 
scientists involved in related research. In addition, the 
system logging information can be used to evaluate system 
performance and design strategies for adjustment in future 
systems . 


DATA ACCESS 


The discussion to this point has emphasized implementation 
aspects of the BROWSE system. This is a reflection of the 
fact that to this date the majority of the research efforts 
have been devoted to system design and implementation. 
However, the ultimate emphasis of this project is to 
provide a remote sensing information system for 
applications scientists to aid in data set selection. The 
database and associated software are now in place and the 
system is currently in its initial phase of operation. 

As previously indicated, . a group of remote sensing 
applications scientists has been targeted to use and 

evaluate the system. Using the workstation software, these 
users may access the database using standard telephone line 
communication protocols. A toll free number has been 

established for this purpose. Using these facilities the 
user can query the database, down-load image data to his or 
her workstation, and display that data locally. 

A typical session would first involve the initial dial up 
and login procedure. After accessing the BROWSE system, 

the user would then be prompted with a set of menus which 
can be used to specify attributes for a query of the 

database. The specific geographic area of interest is 

identified interactively by the user by graphically 
identifying a window on a map plotted on the workstation 
screen using a pointing device. The user is then prompted 
for more scene specific information. For example, a 
climatologist interested in land surface energy balance 

research in the Santa Barbara area might enter the 

following attributes: 

Database of interest: CATALOG 
platform: all 
sensor: all 

begin date : iune 1, 1979 
end date: sept 1, 1979 

to which the system might respond with information of the 
following nature: 

BROWSE database: 2 items found 

1. Heat Capacity Mapping Mission, 

July 3, 1979 

3 bands online, 600m spatial resolution 
scene id # A-A0062-16032-2 

2 . Landsat-3 , 

August 15, 1979 

4 bands online, 79m spatial resolution 
Scene id # 8246517502500 

For more information please enter the scene id or ESC to 
escape: 

If any of the items found are of interest to the user then 



more detailed information (sensor specifications, 
resolution, number of lines and samples, sun angle, etc.) 
can be obtained by entering the specific scene id. In this 
way, the user can 'browse' the image database, and identify 
potentially useful data sets. If an appropriate image is 
found, the user can then exit the database and down-load 
the image data to his or her workstation. The image 
display software can then be used to view the actual image 
data. Therefore, the user has been provided with much 
valuable information regarding the acquisition of the 
actual data set for a specific application. 

CONCLUSIONS 

Superior data management and information system technology 
is required to efficiently manage the immense data volumes 
which are projected for EOS era remote sensing systems and 
other geo-referenced data sets in the 1990 's. Included in 
this domain are issues related to spatial database 
management, distributed database and networking 
technology, data archiving strategies, and information 
systems technology which maximize user accessibility to 
data sources. The BROWSE system has employed rapid 
prototyping methodologies to implement and evaluate 
proposed solutions to some of the above problems. In 
particular, BROWSE has specifically examined questions 
pertinent to the access of spatial databases by users from 
remote locations. Issues which have been examined include 
database design, user accessibility and remote 'browsing' 
of image display on commonly available hardware.. 

To date, a prototype database has been designed and 
populated, and workstation software has been developed 
which allows a user to query the database and view image 
data from his . or her office. The system is currently in 
its first phase of operation and is being used by a group 
of remote sensing applications scientists in order to 
evaluate system performance and features. After a 
preliminary evaluation period the system will be enhanced 
by providing linkages to other data centers and by 
providing an 'intelligent' user interface to aid the user 
in using the system. In this manner, valuable insight will 
be derived towards the development of geo-based information 
systems for remote sensing data in the EOS era. 
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ABSTRACT 

With the mass of remotely sensed data being collected, researchers are often 
unaware of what data is available and where it is stored. Data is frequendy pur- 
chased on the basis of a terse database record describing, for instance, cloud 
cover and image quality. A better approach may be to allow analysts to preview 
spatial data, not just some salient facts about that data. 

In a NASA-sponsored research program, the authors are developing a proto- 
type system with browsing capability for data archives. The objective is to allow 
scientists, sitting at their local workstations, to access a network, to retrieve 
records of image and spatial data selected by user-specified attributes, to view 
low resolution versions of the data, and to place an order, if the data are satisfac- 
tory. The architecture and functioning of the BROWSE testbed is briefly out- 
lined. 

Surprisingly, the definition of BROWSE has found no consensus. This 
paper focuses on the on-going efforts to establish a definition that satisfies a 
broad range of scientific disciplines. Any operational definition will include the 
algorithms used to compress the raw data into a low resolution format. Prelim- 
inary algorithms being considered include single band subsetting, spatial subsam- 
pling, band ratios, principal components analysis, and linear combinations. 
Several browsing scenarios illustrate the complexities of selecting suitable data 
sets. An effective browsing utility will have benefits beyond NASA data systems 
and the Earth Observing System. Lessons learned from this project may be of 
value to other spatial data base designers. 


1. INTRODUCTION 

Effective information systems are an increasingly important aspect of spatial 
data analysis (Estes, 1985). Technical and social innovations provide a forcing 
function for improvements in the design of information systems, driven by the 



increasing demand for and generation of scientific data. Naisbitt (1982) discussed 
several social transformations that characterize corresponding changes in how 
geographical analysis is being done. Among these trends are shifts from: 

• centralization to decentralization 

• hierarchies to networking 

• an industrial to an information society and 

• forced technology to high technology and ‘high touch’, meaning increased 

personal involvement. 

The National Aeronautics and Space Administration (NASA) has recognized 
these trends in establishing pilot data systems for ocean, climate, land, and plane- 
tary science data (Jet Propulsion Laboratory, 1986a; NASA Goddard, 1986; 
NASA, 1986; Jet Propulsion Laboratory, 1986b). Working groups are also 
designing information systems for the vast quantity of data projected from the 
Earth Observing System (EOS) on the Space Station complex in the coming 
decade. The challenge is twofold. The first challenge lies in developing data 
handling techniques for the anticipated large data volumes. The second challenge 
is to bring this new technology to a broader scientific and applications consti- 
tuency in the service of global science (Estes et al, 1986). 

The Committee on Data Management and Computation (CODMAC) of the 
National Academy of Science predicts that an important capability in future spa- 
tial information systems will be the ability to browse through databases from a 
remote terminal (CODMAC, 1982). Browsing could perform three important 
functions. It would allow users to locate and preview spatial data and make an 
informed decision about their utility, for instance, as input to a geographic infor- 
mation system (GIS). Secondly, it would provide a mechanism for the user com- 
munity to view newly received data and make recommendations on whether the 
data are suitable for permanent archiving. A third function of interactive brows- 
ing would be to aid decisions regarding what additional observations to acquire 
during an ongoing mission (CODMAC, 1986). At NASA, this mode of operation 
is being called telescience. However, different disciplines do not necessarily 
share common ideas on optimal browsing formats and characteristics. This paper 
describes attempts to define ‘browsing’ in the context of multidisciplinary spatial 
data systems. This research is part of an on-going project at the University of 
California, Santa Barbara Remote Sensing Research Unit, funded by NASA 
Headquarters. 

A brief overview of the BROWSE testbed facility at UCSB is provided. (A 
more complete description can be found in Star et al, 1987). Next, some tenta- 
tive algorithms for generating browseable data suitable to different disciplines are 
proposed. In addition, several scenarios for browsing, suggested by our scientific 
collaborators, are outlined to suggest the complexity of reaching a consensus in a 
definition. Then some future research directions are discussed. The main focus 


of next year’s effort will be in the evaluation phase. Our hope is that public dis- 
cussion of these issues will not only improve the effectiveness of the BROWSE 
testbed, but will also stimulate creation of browsing utilities on other spatial 
information systems as well. 


2. DESCRIPTION OF THE BROWSE TESTBED FACILITY 

The first year of research, recently ended, saw the development of a rapid 
prototype of a browse utility at UCSB. Our objective was to get an operating 
version on-line and accessible via phone line to a set of collaborators from many 
disciplines and institutions around the United States. In the current year, we plan 
to incorporate their feedback on the usefulness of the system for their respective 
needs. What follows is a very brief description of the testbed facility. 

The host facility consists of a MicroVAX II Workstation with peripheral 
tape drive, two 319 megabyte hard disks, a VT260 terminal, and a PC- AT as a 
workstation. Resident on the host computer are the database management system 
(DBMS), the user interface program, two databases, and a set of preprocessed 
browseable images. The DBMS used is a public-domain package, Relational 
Information Management (RIM), also being used by the NASA Ocean Data Sys- 
tem (NODS) operated by the Jet Propulsion Laboratory. 

To save the user from having to learn RIM syntax or the details of the data- 
bases, we have written a menu interface that prompts the user for query attributes. 
The Catalog data base contains detailed records of individual images. As seen in 
the Catalog menu in Figure 1, the user can restrict the search to specific geo- 
graphic area, sensors, dates, maximum cloud cover, and browseable status. The 
graphic mode for identifying a geographic search space is shown in Figure 2, 
using Santa Barbara, California as an example. The program then automatically 
formulates the appropriate database query. The current query attributes or the 
current set of records retrieved can then be displayed. Figure 3 shows an exam- 
ple of records retrieved for the same area as shown in Figure 2. Realize that at 
this point in the session, only information about the data is provided, not imagery. 
This aspect is similar to a computerized literature search at a library or the type 
of query the EROS data center has supported for several years (USGS, 1980). 

When a Catalog database record sounds useful, the user invokes KERMIT, a 
public domain communications package, to transmit the preprocessed image to 
their local terminal. This low resolution version can then be displayed locally 
using an image display program written for the IBM PC-AT workstation. Using 
multiple windows, the display includes a single band image, its histogram, and 
statistics. Currently, the display uses a level slicing routine to divide the grey 
values into eight colors, which can be adjusted through contrast enhancement 
functions in the program. Figure 4 shows a black and white reproduction of the 
BROWSE image corresponding to the database record of Figure 3. At this point. 
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the user can make a more informed decision on the suitability of the data for a 
spatial data processing application than from the data record alone. What makes 
BROWSE unique is the capability of both locating and displaying spatial data at 
a remote terminal. 

An alternative menu path guides the user through the Directory database 
containing addresses and collection emphasis of various data centers around the 
world. Therefore, if suitable imagery is not available in the relatively small col- 
lection at the UCSB Remote Sensing Research Unit, the user can be directed to 
other likely sources. We plan to add Catalogs for some of these other archives to 
the BROWSE network. A third database of user information is planned for the 
coming year. This User database will permit a researcher to find other investiga- 
tors who are knowledgeable about particular geographic regions, sensors, or dis- 
ciplines. 


3. PROPOSED BROWSING FORMATS 

From the description of the testbed facility, browsing in this context can be 
very generally defined as locating spatial data and viewing it in some graphical 
form. This is not unlike how a researcher browses in a library— going to the shelf 
of relevant books, selecting some likely candidates, looking at the abstracts' (i.e., a 
low resolution version of the entire text), rejecting some, finding new suggestions 
in the list of references, and often making serendipitous discoveries in material 
they were not originally aware of. It is this sense of discovery and exploration 
that can add Naisbitt’s sense of ‘high touch’ to the high technology of computer- 
ized data retrieval. 

Each of the NASA pilot data systems are implementing some form of 
browse capability. NODS includes a browse facility with data tabulations, graph- 
ics, and image formats. The user chooses a format and is presented with a list of 
names of browse files to view (Jet Propulsion Laboratory, 1986a). These files 
contain preprocessed examples of each format. Browse images of 4500 Coastal 
Zone Color Scanner and Advanced Very High Resolution Radiometer scenes for 
the West Coast Time Series have been recently put on line. At the Pilot Climate 
Data System, browsing includes looking at image data, but it also refers to look- 
ing at descriptions of sensors and climate parameters (NASA Goddard, 1986). 
Browsing or ‘quick-look’ support is also specified in the functional requirements 
for the Pilot Land Data System. Significantly, browsing is included under the 
data management requirements rather than those for analysis software (NASA, 
1986). Recognition of this role underscores the point that browsing is a function 
to aid in data selection and management and should not supplant or duplicate GIS 
or image processing analytical functions. The Planetary Data System, is using 
videodisks for electronic browsing of planetary images (Jet Propulsion Labora- 
tory, 1986b). A browse utility for EOS data is also specified (Broome, 1986). 
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Other data systems outside NASA are also providing browsing functions (Kubo, 
1986). However, little detail of what these ‘quick-look low resolution’ views 
should show is described in any of these systems. 

Among our objectives in the BROWSE project is to determine if there is a 
basic set of processing algorithms to generate suitable browse images. At the end 
of the project, we intend to provide NASA with a matrix of algorithms preferred 
by each discipline and suggest a small subset of these algorithms that present data 
in ways the majority of researchers require. The challenge is to satisfy scientists 
in fields as diverse as oceanography, forestry, and meteorology. The difficulty 
can be illustrated by considering clouds as viewed by a spacebome multispectral 
scanner. To many specialists, clouds are a hindrance to information extraction. 
Therefore, extensive cloud cover is a negative attribute to be avoided or masked 
from the data. For some meteorological applications, however, it is precisely the 
clouds that are significant. 

Providing low resolution browse data presumably involves compression in 
the spatial, spectral, radiometric, or temporal domains, i.e., subsampling the origi- 
nal data. Data compression offers several benefits in an information system and 
one major drawback. On the positive side, compression reduces the data storage 
and transmission loads on the system. Although storage capabilities are increas- 
ing (and costs decreasing) in the computer industry, the impact of EOS’s pro- 
posed 262 channel High Resolution Imaging Spectrometer (HIRIS) will still be 
enormous in the foreseeable future (Broome, 1986). The projected data rate from 
HIRIS translates into one 6250 bpi density tape every second. The disadvantage 
of compression is that, after a certain level of reduction, the process is irreversi- 
ble. Just as an abstract cannot be transformed into the original book, a highly 
compressed image cannot be converted by the user into the raw data. In other 
words, BROWSE only supplies the abstracts, as it were, so that the user can 
decide whether to buy the whole book. 

The simplest approach to image compression is to discard all but a single 
spectral band. For purposes of browsing, a single band is often adequate for 
determining whether image data is suitable for input and analysis in a GIS. A 
visible wavelength band is appropriate for revealing the effects of clouds and 
atmospheric disturbance, for determining the presence of man-made objects, or 
for penetrating water surfaces. Other wavelengths have their own best uses. 

A second approach capitalizes on the known relationships between spectral 
bands by using ratios of two bands. A ratio of reflective infrared over red 
wavelengths is commonly used in vegetative studies. The relative brightness is 
an indicator of biomass (Jackson, 1984). Geologists frequently use a mid infrared 
over near IR, a mid-IR over another mid-IR, or a red over blue band ratio to 
determine composition of surface materials (Crippen et al, in press). 

Principal components analysis takes advantage of correlation in a dataset to 
transform it into a smaller number of dimensions. Significant scene variance may 
be conserved in a few synthetic output channels (i.e., components), while 



uncorrelated noise is collected in lower order components that can be discarded in 
many circumstances. We hypothesize that a single component will show enough 
information for a browser. The disadvantage of principal components analysis is 
that it can be very time consuming to compute. The transformation coefficients 
must be recalculated for every image, and the components may be difficult to 
interpret since they are scene dependent 

Linear combinations are . another way to transform image data, using a 
predefined set of coefficients. Each combination makes a specific rotation of data 
in spectral space to optimize a particular feature. Perhaps the best known of this 
type is the Tasseled Cap transformations for Landsat Multispectral Scanner data 
emphasizing soil brightness and vegetation greenness (Kauth et al, 1976). A 
similar transformation has been developed for Thematic Mapper data (Crist et al, 
1984). In essence, the first transformed axis is an ordination from wet to dry soil, 
while the second axis indicates biomass as a distance from the bare soil baseline. 
Coefficients can be easily derived for other sensors but the interpretation of the 
axes may be unique for each type (Jackson, 1983). 


4. SOME BROWSING SCENARIOS 

It is impossible to predict all the ways researchers will want to use the 
BROWSE facility. The collaborators who have assisted us by describing their 
browsing needs tend to be experienced in remote sensing applications. The new 
constituency NASA hopes to reach is harder to identify, and new users may have 
difficulty articulating what they need, other than that the system be helpful and 
flexible. This section describes a few of the scenarios which our multidisci- 
plinary collaborators have identified as the process they might use in gathering 
GIS data. 

As we progress into the age of global science, researchers are collecting 
datasets for large regions at high resolution. An analyst working at this scale 
might need to see a low resolution, single band view of many adjacent scenes 
acquired in a particular season by the same sensor under relatively clear atmos- 
pheric conditions. 

Research projects sometimes involve multistage sampling, with low resolu- 
tion data of large areas and higher resolution for selected samples. A browse 
facility could determine if data of all required types were available and suitable at 
the study site. Alternatively, when multisensor data is required, the DBMS could 
identify sites with all the required types. By browsing through these sites, the 
analyst could select the most suitable for further investigation and commitment of 
staff and funds. 

Discipline specialists new to remote sensing may not know the capabilities 
of different sensors. In an extreme example, an investigator wanting to census 
elk populations over a large region may believe that a single scene of a low 



resolution sensor like the Advanced Very High Resolution Radiometer (AVHRR) 
would be adequate. A quick-look at a browse image should convince them 
immediately that AVHRR data in not appropriate for the task. 

Misplacing research data over time as students come and go is an embarrass- 
ing fact of life in many academic departments. It is hopeless, as we ourselves 
have discovered, to go through a storeroom full of poorly labeled tapes, trying to 
find a valuable dataset someone in the past has painstakingly assembled. If a 
browse image of each dataset were available, along with a trail to the dataset 
itself, researchers might spend less time recreating data. 


5. FUTURE DEVELOPMENT PLANS 

The first year of development focused on rapid prototyping of the BROWSE 
testbed facility and exploring what ‘browse’ means to different disciplines. The 
existing prototype consists of a single node at UCSB with a relatively small data- 
base. In the coming grant year, we intend to expand the system to include at 
least one or two additional nodes with different types of image data. To accom- 
plish this involves developing a high level activity manager that queries the 
appropriate databases around the network, transparently to the user, so a person 
does not need to learn multiple query languages. 

The prototype facility has now become accessible via standard phone lines 
to our collaborators. We will be monitoring their use and incorporating their 
feedback on the user interface and the types of browse formats. In concert with 
this, we will be examining data compression in greater detail. Tradeoffs between 
storage overhead of preprocessed image data versus time delays of processing on 
demand will be assessed. 

The existing prototype assumes the user can select appropriate sensors and 
will recognize when data are suitable. Since NASA hopes to establish a wider 
constituency for data acquired in space, more expert guidance will be necessary 
to aid new users in formulating appropriate queries. Development of an expert 
system is planned that can give intelligent advice on query attributes. The user 
need respond only with the parameters of their task i.e., the view that is more 
familiar to them. The initial expert enhancement will be in the domain of vegeta- 
tion applications where a substantial knowledge base can be assembled from the 
research literature. 


6. CONCLUSIONS 

A capability to browse spatial data will be an important function in future 
spatial data systems. Researchers will browse in order to locate suitable input 
data, to recommend imagery for permanent archiving, and to request additional 
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data acquisitions from ongoing missions. When EOS begins generating massive 
volumes of data in support of global science, the need for these three functions 
will be even greater. The BROWSE project outlined here indicates one effort by 
NASA headquarters to prepare for the EOS era. 

As defined above, ‘browsing’ involves both locating and viewing spatial 
data. The browse function is most appropriate as a data management tool. We 
do not see it as a replication of existing GIS or image processing analytical func- 
tions. Several methods of compressing data into browseable format have been 
discussed. Additional methods, from disciplines different from those of the 
authors, may be identified during the testing and evaluation phase. Planned 
enhancements should make BROWSE more effective in serving a wider consti- 
tuency of users, which is one of NASA’s objectives. To be effective, a browsing 
function should also be fast, flexible, and friendly. 

Persons interested in joining the list of BROWSE users/testers are 
encouraged to contact the authors at the above address. Together we can give 
browsing for spatial data a degree of ‘high touch’ comparable to the level of high 
tech provided by computerized database management. 
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ABSTRACT 

Many scientists use geographic location as a fundamental attribute to 
identify interesting earth science data sets. The specification of this 
attribute in database queries is both tedious and prone to error. MAPWD is a 
graphical tool which simplifies this specification by allowing the user to point 
to areas of interest using an electronic map, instead of manually entering 
coordinate values. With this tool the user is able to identify regions of 
interest using maps with comparable detail at a wide range of spatial scales. 
This functionality has been achieved by hierarchically structuring the database 
of map vector data into separate files based on both geographic location and 
resolution. 


Introduction 

Earth science data sets are growing rapidly in both size and variety. Data 
archives are currently being compiled by numerous agencies at a wide range of 
spatial scales which are of a higher accuracy, resolution and volume than ever 
before. These data sets encompass a multiplicity of formats and data types 
including topographic, soils, climate, oceanographic, and hydrologic data to 
name but a few. An important issue confronting earth science is the efficient 
management of these scientific data sets. 
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The development of an efficient earth science data management system 
encompasses a diverse range of issues including: 

- database design parameters 

- optimal strategies for data archiving and deletion 

- superior user access to distributed data archives 

- conformal formats for data exchange and accuracy standards. 

In this paper we describe a program called MAPWD. This program comprises one 
component of a user friendly interface to spatially referenced data cataloged in 
a relational database management system. The goal of this interface is to aid 
the user in specifying queries to the underlying database. 

A fundamental requirement of queries to databases of spatially referenced 
data is the efficient specification of search constraints based upon geographic 
attributes. The purpose of MAPWD is to simplify the specification of locational 
attributes for queries to the database. This interactive spatial query system 
allows the user to specify a geographic region of interest by pointing to 
locations on a virtual map that is displayed on the workstation rather than 
manually entering the corresponding geographic coordinates. 

MAPWD is written in Fortran77 using a Digital Equipment Corporation (DEC) 
implementation of the graphics kernel standard (GKS) graphics programming 
standard (level lb) to implement the various graphics necessary for this type of 
utility (DEC 1987). The MAPWD program represents one component of a larger 
system entitled "BROWSE". BROWSE is designed as a testbed for implementing and 
evaluating strategies for improving user accessibility to data archives (Star et 
al., 1987). The BROWSE data management system for which MAPWD was developed is 
specifically designed to handle remotely sensed data. However, most of the 
concepts described in this paper may be generalized to any spatially referenced 
data set (i.e. maps, photography, and geophysical data). 
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Spatially Referenced Data Sets 

Most earth sciences data sets may be considered to be a subset of the more 
general category of geographical data. Geographical data are characterized by 
both spatial and non-spatial attributes which may be used to locate objects 
stored within the data base. Spatial attributes describe characteristics such 
as the geographic coverage of the data, while non-spatial attributes may include 
items such as the date of data acquisition and other data set specific 
information. Geographical data sets are defined to be spatially referenced in 
that the location of each object in the database is included using some fixed 
geographic coordinate system. In addition, topological aspects (e.g., 
adjacency, proximity, etc.) as well as descriptive attribute information may be 
recorded for any object. Indeed, it is the topological and spatial 
characteristics of geographical data processing systems that distinguish them 
from more conventional data management systems for applications such as banking 
or library systems (Burrough 1986). 

Database management systems (DBMS) have been the focus of extensive 
research and development in computer science. Current data models which have 
been developed for DBMS technology are highly sophisticated. However, the 
majority of this effort has been geared towards handling data which is non- 
spatial in nature. Consequently, in recent years there has been much interest 
among the geo-processing community in adapting existing DBMS technology to 
handle geographic data in a more natural manner (van Roessel 1987, CODMAC 1986, 
Lorie and Meir 1984). In particular, the relational data model (Codd 1970) has 
been employed for a variety of geographical data processing applications. 



Browse 


The BROWSE system (Star et al. 1987, 1988) is a prototype data management 
system designed to manage remotely sensed and other spatially referenced data 
sets. The system has been implemented on a Micro VAX II using a public domain 
relational DBMS called RIM (Relational Information Management, Johnson and 
Johnson 1985). This DBMS is equipped with a Fortran interface which we have 
used to develop a menu driven user interface to the underlying DBMS. The user 
interface is organized as a hierarchy of menus which guide the user’s selection 
of query attributes. The user is prompted for attribute information, and the 
query is formulated and performed using subroutine calls embedded in the Fortran 
source code. In this manner, the user is not required to have a knowledge of 
relational algebra, nor of a specific query language. 

A primary focus of BROWSE is related to examining techniques for improved 
user access to scientific data sets. Currently, numerous large scale scientific 
data sets are in various stages of creation, with spatial coverage varying in 
scope from regional to global (Mooneyhan, 1987). These data sets encompass 
numerous disciplines and are being archived at different locations using 
different digital storage systems and data formats. 

A key to the effective utilization of these highly valuable data archives 
is maximizing the accessibility of the scientific community to these data. As 
part of the solution to this problem the Committee on Data Management and 
Computation (CODMAC) has recommended that: 

"NASA evaluate and develop the concept of an ’electronic 
browse’ capability, allowing users to explore data files 
via communication links. We recommend that ’quick-look’ 
low resolution data be included for use in electronic 
browsing" (CODMAC 1982). 
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Capabilities such as those specified above are the focus of the BROWSE project. 
Running on workstations which emulate Tektroniks 4014 displays, the MAPWD 
routine embedded within the BROWSE system represents one specific case of 
improving the ability for researchers to efficiently access earth science 
databases. Other techniques for improving the utility and general applicability 
of the BROWSE system such as a quick look image display capabilities and object 
oriented search strategies have been developed or are in planning (Wright and 
Friedl 1987, Stoms et al. 1988). 


MAPWD 

A fundamental attribute of earth science data sets is location and/or areal 
coverage. MAPWD is a tool which allows users to specify geographical regions 
(windows) of interest in a database query by pointing to them on an electronic 
map displayed at their workstation. This program is implemented as part of the 
user interface to the BROWSE system. However, the general concepts could be 
easily modified and the source code adapted to any database of spatially 
referenced datasets. The program is written in DEC Fortran77 and requires a 
Tektroniks 4014 class graphics workstation, or one equipped with emulation 
software. A complete set of the DEC GKS run-time libraries (level lb) is also 
necessary to run the software. A complete source code listing is provided in 
appendix A 

The user is able to iteratively refine the spatial window specification by 
successively zooming into and away from a region of interest. When the user is 
satisfied with the window selection, the geographic coordinates are passed to 
BROWSE as part of a query specification. We believe this to be a superior 
approach to manually typing in coordinate values using the keyboard. Using a 
conventional relational database system, a user would specify a geographic 
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region of interest by manually entering the coordinates for the window 
containing that region. For example, using a SQL (structured query language, 
Chamberlin et al. 1976) based query language the general form of this query 
would be: 


SELECT sceneid FROM images 
WHERE NWJat It 40 
and NW_long It -120 
and SE_lat gt 30 
and SE_long gt -118 

This query would find those scene identification numbers of images which lie 
within the geographic window with north west comer coordinates of 40N 120W and 
south east comer coordinates of 30N 118W. Specifying such a query is both 
tedious and prone to error because the user is forced to manually access atlas 
type products, estimate coordinate boundaries for study areas, translate these 
values into the particular coordinate system of the database, and finally 
include these values in a database query. 

In contrast, MAPWD provides the BROWSE user with a simple and explicit 
method of selecting geographic areas of interest. The MAPWD display consists of 
four window areas (fig. 1): 

1) a reference map window, 

2) a zoomed map window, 

3) a menu window, 

4) and a display information window. 

MAPWD is called from within BROWSE. To use the system, a user selects the menu 
item in BROWSE corresponding to "select geographic area." The user then 
identifies a general region of interest within a map of the globe and is 
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presented with a base map of that area. Using a mouse or keystrokes, the user 
defines a rectangular window of interest on the base map by pointing to the 
desired window corners. The zoomed display window is then rescaled and plotted 
with only those features located within the specified window. Features outside 
of that window are clipped (fig. 2). When the final area of interest has been 
defined, the coordinates are returned to the BROWSE main program and included as 
part of a relational database query. 

The algorithm for efficiently performing this procedure consists of simple 
graphic manipulations using hierarchically organized data components. Vector 
files containing the map data at several levels of resolution (i.e., detail) are 
stored on-line in disk files. Based on the window specification a set of vector 
files is selected and plotted. The vector files are in latitude/longitude 
format and are stored in binary form to reduce space requirements and improve 
I/O performance. 

The zoom mechanism which allows the user to iteratively refine a specific 
geographic window is simple to implement within a GKS graphics environment. 
Functions for both ’picking’ map coordinates and automatic clipping are built 
into this graphics standard. A more difficult problem is to develop an 
efficient method for plotting the vector files at various levels of resolution. 
Stated in another way, it is necessary to have vector files at several levels of 
resolution which may or may not be plotted depending on the map scale to which 
the user is zoomed. Consider, for example, the case of North America. At the 
continent level it is both unnecessary and inefficient to plot U.S. county 
boundaries. At the same time, if the display is to provide useful geographic 
information for orientation, it is necessary to include this level of detail 
when examining regions with the areal extent of only a few states. 

A solution to this problem was achieved by hierarchically structuring the 
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vector database into different files and assigning a zoom level to each file 
(fig. 3). In addition, at each zoom level the vector database is segmented 
geographically into separate files. Such segmentation of a cartographic 
database is referred to as "tiling" (Burrough 1986). The zoom level and 
geographic bounds for each vector file are stored in a lookup table for each 
continent. For any given window selection, a table lookup is performed and 
those vector files which are at the appropriate zoom level and which are within, 
or intersect, the geographic window selected are plotted. In addition, a point 
file of cities and major physiographic features is plotted in a similar manner. 

In this way, the plotting procedure is greatly optimized compared to 
storing one high resolution vector file which is plotted at all scales. Also, 
this format allows the database of vector files to be dynamically updated 
without having to update the actual source code. Instead, vector files may be 
updated, added, or deleted by simply modifying the lookup table resident in a 
disk file. This concept may also be extended to include different types of 
thematic data at different levels of resolution depending on the specific 
scientific domain and geographic area of interest. 

The alternative to this type of approach is to use line generalization 
techniques (Douglas and Peuker 1973). Line generalization algorithms modify the 
degree of precision in vector representation dynamically at run time. A simple 
example of this type of approach would be to systematically exclude points from 
the same vector file at successive zoom levels. However, this type of algorithm 
is less efficient, more difficult to implement, and prone to distortion effects 
(Monmonnier 1982) relative to the multi-resolution vector file approach 
implemented in MAPWD. 

A key utility of the MAPWD approach to specifying geographic windows is 
that it is adaptable to searching for data sets at a wide range of space scales, 
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ranging from highly local to global in scope. For example, a micro- 
climatologist may require data for a region of less than 100 square kilometers. 
Concurrently, an investigator interested in earth system science investigations 
may wish to search for data on a continental or even global scale. In this 
context, MAPWD allows the user to specify windows using maps of comparable 
detail at effectively any resolution. The technique employed by MAPWD also 
allows for improved multidisciplinary access to a database, as lookup tables may 
selectively display vector files which contain information which is relevant to 
a specific field of study. For example, a researcher searching for data on 
regional vegetation will be presented with a map of physical and political 
features of continents, while an oceanographer would be presented with a map of 
bathymetric features. 


Conclusions 

The development of techniques for improved access to the vast collection of 
earth sciences data sets anticipated in the 1990’s will be of great importance 
for advanced earth science research in the near future. While only part of a 
larger effort to develop these capabilities in the BROWSE program, the MAPWD 
spatial query system represents a valuable method of simplifying and improving 
the capability of identifying relevant earth science data sets. This is 
achieved by the interactive specification of geographic bounds using electronic 
maps at a variety of scales. The format of data storage and presentation within 
MAPWD allows for rapid and flexible spatial data querying for users from 
numerous backgrounds conducting studies at any number of scales. Any readers 
interested in using the BROWSE system and associated MAPWD software should 
contact any of the authors through the Department of Geography at UCSB. 
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Captions for Figures 


Figure 1. MAPWD display layout consisting of multiple windows. 
Clockwise from upper left: 1. menu window; 2. reference map 
window; 3. display information window; 4. zoomed map window. 


Figure 2. Sequence of map zooming operations to specify 
geographic window in southern California. Note variable 
resolution and changing degree of detail as iterative 
zooming is performed. 


Figure 3. Schematic representing hierarchical structuring of 
vector database. Multiple vector files are stored with 
variable degrees of resolution or detail. At any specific 
zoom level, vectors are segmented geographically using a 
technique known as tiling. A lookup table is used to plot 
appropriate vector files based on the zoom level and window 
of the area selected by the user. 
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Browse in the EOS Era 


Meeting Attendees 

The following is the list of attendees at the team meetings we have sponsored during 
the first contract year. 

Frequent irregular meetings through the year at UCSB, for local faculty: 

David Simonett 
Raymond Smith 
Earl Hajic 
Frank Davis 

Spring meeting at UCSB - May 1987 
Dr. Ron Blom, JPL 
Mr. Jim Brass, ARC 

Dr. Supriya Chakrabarti - Space Sciences Lab, UC Berkeley 
Mr. Len Gaydos, USGS Menlo Park 
Dr. Thomas Logan - JPL 

Prof. Doug Stowe - Geography, San Diego State Univ. 

Spring meeting at GSFC - June 1987 

Prof. Peter Cornillon - Oceanography, Univ. Rhode Island 
Prof. Bob Ragan - Civil Engineering, Univ. Maryland 
Dr. Nick Faust - Georgia Tech Research Inst., Atlanta 
Dr. Forest Hall - GSFC 

Dr. Paul Smith - GSFC (now at NASA Headquarters) 

Winter meeting at UCSB - December 1987 
Dr. Bob Chase - Univ. Colorado, Boulder 
Dr. Dick Collier - Purdue Univ., NPIRS 
Mr. Ralph Dedecker - SSEC, U. Wisconsin, Madison 
Prof. John Jensen - Geography, Univ. South Carolina 
Dr. Tom George - Geophysical Institute, U. Alaska, Fairbanks 
Dr. Nick Faust - Georgia Tech Research Inst. 
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Internationa! Meetings 

During the past year, the University of California at Santa Barbara (UCSB) Infor- 
mation Sciences Research Group (ISRG) personnel have become increasingly involved 
in activities associated with the United Nations Environment Programs (UNEP) Global 
Environmental Monitoring System (GEMS) Global Resource Information Database 
(GRID) activity. 

During September 1987, Dr. Star participated in a meeting held at the United 
Nations Environment Programme offices in Nairobi, Kenya. Particular emphasis at this 
meeting was on the information systems support that the GRID program will require as it 
becomes operational over the next several years. The invitation to join this working 
group came from Dr. Harvey Croze, GRID coordinator. 

At this meeting, the working group reviewed the history, status, and future objec- 
tives of the GRID program. The pilot program, which began in 1984, was based on 
software and hardware donated or loaned to UNEP by government and private organiza- 
tions in Europe and the United States. After two years of the pilot program, a decision 
was made by the UNEP in the summer of 1987 to begin an implementation phase for 
GRID. This next phase is to begin in early 1988 and continue for two years. Our meeting 
in September 1987 came out of this decision; we were to provide guidance for the start 
of this next phase. 

A second element of our discussions in September regarded long-term data 
management issues. At this time, there is a relatively small volume of data in the GRID 
system. Plans must be made to begin to develop ways to manage the future large data 
volumes, as well as provide means for users to access both the data itself as well as 
descriptive information about the data. I made a slide presentation to both the GRID staff 
and the working group on the BROWSE testbed activity, funded through NASA Head- 
quarters. The BROWSE testbed, at a conceptual level, provides input to their planning in 
several ways. First, it demonstrates a way to index spatial data holdings in terms of 
prescribed attributes (geographic coverage, date of compilation, thematic information, 
etc.). These attributes are then freely searched by the users to identify relevant datasets. 
Second, since the BROWSE testbed includes capabilities to store limited image datasets, 
the same system provides limited access to the data itself. This is particularly useful in 
the case of the UNEP/GRID program, since there are, at present, no methods for charg- 
ing users fees even to offset database duplication costs. A future system similar to 
BROWSE could provide electronic access to both data and data descriptions with no 
direct staff costs. Croze and Mooneyhan were extremely interested in our development. 

I have forwarded copies of the BROWSE user guide to them, and Dr. Croze has sched- 
uled a visit to UCSB to see the testbed in action and have further discussions with us. 

A related meeting was held in Kew, England, at the World Conservation Monitor- 
ing Centre. At this meeting, funded by the United Nations, the overall architecture of 
WCMC’s data and information system was evaluated, as a starting point for hardware 
and software system migration in the future. The BROWSE testbed was again a focus of 
discussion, as a prototype user interface for non-expert data system users. WCWM is 
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funded by the United Nations Environment Programme, International Union for the 
Conservation of Nature, and the World Wildlife Fund, and representatives of these 
agencies attended. 

Based on this first meeting, we have submitted a letter of interest to the United 
Nations Environment Programme, to begin to develop some of the elements of a distrib- 
uted data and information system for WCMC. UNEP staff indicated that they are inter- 
ested in locating funds for our participation in this effort from their sources, and we look 
forward to this new collaboration with UNEP and WCMC. 
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Ftecent Testbed Users 

The following records indicate some of the recent users of our system, based on their 
first date of use. These are in addition to our team of collaborators. 

Date: 7-OCT-88 Current Pacific Coast Time: 14:52:55 
User: J.B. Dalton 

Institution: NASA/Ames Research Center 
Discipline: Planetary Atmospheres, Geophysics 

Hardware: HDS 3200 model 30 (VT220,TEK40) Access: DIAL-UP/MODEM 

Date: 17-OCT-88 Current Pacific Coast Time: 12:02:58 
User: Matt Smith 

Institution: Marshall Space Flight Center 
Discipline: Atmospheric Science 

Hardware: IBM PS/2 80 Access: SPAN/DECNET 

Date: 17-OCT-88 Current Pacific Coast Time: 12:43:04 
User: Fred Waltz 
Institution: Eros Data Center 

Hardware: modgraph Access: SPAN/DECNET 

Date: 18-OCT-88 Current Pacific Coast Time: 12:28:36 
User: Dr. Tom George 
Institution: University of Alaska 
Discipline: Interdisciplinary 

Hardware: MAC II via SPAN Access: SPAN/DECNET 

Date: 26-OCT-88 Current Pacific Coast Time: 13:39:42 
User: Dr. Chester Richmond 
Institution: Oak Ridge National Lab 
Discipline: biology 

Hardware: microvax Access: SPAN/DECNET 

Date: 27-OCT-88 Current Pacific Coast Time: 09:50:48 
User: Prof. R. Chase 
Institution: Colorado University, Boulder 
Discipline: large scale hydrodynamics 
Hardware: sun Access: ARPANET/NSFNET 

Date: 4-NOV-88 Current Pacific Coast Time: 10:09:36 
User: Michael Goodman 
Institution: Marshall Space Flight Center 
Discipline: none 

Hardware: pc/at Access: DIAL-UP/MODEM 

Date: 8-NOV-88 Current Pacific Coast Time: 09:12:40 
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User: Prof. Ray Arvidson 

Institution: Washington University, St. Louis 

Discipline: geoscience 

Hardware: vax750 Access: SPAN/DECNET 

Date: IO-NOV-88 Current Pacific Coast Time: 14:40:01 
User: Joe Bredekamp 
Institution: Nasa Headquarters 
Discipline: info sys 

Hardware: vax Access: SPAN/DECNET 

Date: 14-NOV-88 Current Pacific Coast Time: 15:53:15 
User: Dr. Heiner Biesel 
Institution: Evans and Sutherland 
Discipline: 1 

Hardware: vaxstation Access: SPAN/DECNET 

Date: 16-NOV-88 Current Pacific Coast Time: 15:47:56 
User: Condor Recovery team 

Institution: California Department of Fish and Game, USFWS, Audubon 
Discipline: Ecology 

Hardware: microvax Access: SPAN/DECNET 

Date: 12- JAN-89 Current Pacific Coast Time: 14:16:36 
User: David Salisbury 
Institution: UCSB 
Discipline: public relations 

Hardware: vax Access: SPAN/DECNET 

Date: 19-JAN-89 Current Pacific Coast Time: 14:17:41 
User: Dr. John Good 

Institution: California Institute of Technology/IPAC 
Discipline: Astrophysics 

Hardware: pc/color/1200 baud modem Access: DIAL-UP/MODEM 

Date: 19-JAN-89 Current Pacific Coast Time: 16:23:45 
User: Tadas Sesplaukis 

Institution: California Institute of Technology/IPAC 
Discipline: astronomy 

Hardware: ibm/pc Access: ARPANET/NSFNET 

Date: 16-FEB-89 Current Pacific Coast Time: 10:39:32 
User: Peter Price 

Institution: Marathon Oil Company, Houston 
Discipline: Remote Sensing 

Hardware: AMIGA(IBM-PC COM) Access: DIAL-UP/MODEM 
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Date: 23-FEB-89 Current Pacific Coast Time: 12:28:41 
User: CHarles Martin 
Institution: Lockheed 
Discipline: Image Processing 

Hardware: MICROVAX II Access: SPAN/DECNET 

Date: 2-MAR-89 Current Pacific Coast Time: 15:11:17 
TEK 4010 terminal 
User: Dr. Harry Miles 
Institution: The Nature Conservancy 
Discipline: yes 

Hardware: vaxstation Access: SPAN/DECNET 

Date: 4-MAR-89 Current Pacific Coast Time: 12:16:20 
User: Hunter Poole 
Institution: Lockheed 
Discipline: remote sensing analysis 
Hardware: MICROVAX II Access: SPAN/DECNET 

Date: 7-MAR-89 Current Pacific Coast Time: 10:44:51 
User: Mr. T.M. Albert 
Institution: USGS Information Systems 
Discipline: none 

Hardware: vaxstation Access: SPAN/DECNET 

Date: 12-MAR-89 Current Pacific Coast Time: 13:37:36 
User: Fayne Sisco 

Institution: IBM Houston, Federal Systems Division 
Discipline: System Engineering 

Hardware: PS/2 Mod 70 Access: DIAL-UP/MODEM 

Date: 27-APR-89 Current Pacific Coast Time: 14:00:22 
User: Lance Goode 
Institution: Navigation Technologies 
Discipline: none 

Hardware: vax station Access: SPAN/DECNET 
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BROWSE User’s Guide 


Build 2 


1. Introduction 

1.1. Major Revisions in Build 2 

Based on feedback from BROWSE users, the following revisions have been 
made in the BROWSE software for this new version: 

1) The electronic map has been improved to make it display much faster. 
Also, for selected areas, as you zoom in, the map detail increases. 

2) Semi-automatic procedures have been added for transferring browse files 
with DECNET and TCP/BP protocols, in addition to KERMIT for dial-up mode. 

3) The term ’CATALOG’ has been changed to ’INVENTORY’ throughout the 
BROWSE testbed to conform with the new ESADS and DIF lexicons. 

4) Traps for erroneous user input have been added so the user won’t be 
dropped from the program. 

5) A Quit option has been added to the routine for displaying records from 
the INVENTORY database, so you don’t have to read long lists of records. 

6) Prompts and other screen text, have been clarified. 

7) More browse files from additional sensors have been added to the 
testbed. 

1.2. Purpose of Document 

This document updates the details on the use of the NASA BROWSE testbed 
facility at the University of California, Santa Barbara. Basic and applied 
research conducted on this project were made under the auspices of NASA Grant 
NAGW-987. BROWSE is designed to investigate the concepts that would allow 
scientists to locate image and other spatial data in distributed archives and 
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to preview that data from terminals in their offices. As such, BROWSE is a 
prototype and not a fully polished, operational information system. 


1.3. Scope of Document 


This document contains a description of BROWSE from the user’s standpoint. 
The underlying concepts of BROWSE are briefly explained in chapter 2, followed 
by an overview of the system components in chapter 3. Chapter 4 presents 
details on the use of each BROWSE component, including menus and example 
queries. 


1.4. Definitions 

Attribute: description of the column entries in a relation. 

Browse file: preprocessed, low resolution file of spatial data accessible 

through the INVENTORY database. 

Browsing: the process of locating and viewing spatial data from a remote 
terminal. 

Catalog interoperability: a capability of permitting a user to access 

different databases with different schemas and query languages through the 
use of only one of them. 

Database: an integrated collection of occurrences of a number of record 

types, where the record types and their occurrences are interrelated by 
various specific relationships. 

Directory: top-level index containing information about location, 

ownership, contents of databases. 

Image compression: a method of encoding digital image data with less 

redundancy to achieve greater efficiency of storage and transmission. 

Inventory: descriptions of a database in sufficient detail to retrieve 
subsets of data. Searchable by data fields or attributes, down to some 
level of granularity. 

Node: a computer facility connected by network to the BROWSE testbed. 

Record: a row of data in a relation (i.e., a tuple). 

Relation: a table of data in a database. 

Relational database: a database developed on the relational model; i.e., 
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based on tables. 

Schema: a description of the database that is compiled or translated into 

machine-readable form. 

Tuple: a row of data in a relation (i.e., a record). 

2. BROWSE Concept 

2.1. Definition of BROWSE Capability 

The purpose of the NASA grant to UCSB was to examine problems related to 
browsing of image data sets by scientists from many disciplines at widely 
distributed locations. One of the problems identified was to define exactly 
what is meant by "browsing". Generally, "browse" is being used to describe a 
process of locating image data and pertinent information about that data, as 
well as obtaining a quick-look (in some form) for a decision concerning 
acquisition from the host site. The quick-look might involve a compressed 
display (i.e., reduced spatial, spectral, or radiometric resolution) of the 
imagery, image histograms, or image statistics. The intent is not to make the 
full dataset directly available to users nor to duplicate image analysis 
capabilities. 

2.2. Need for Browsing 

BROWSE anticipates the high data volumes associated with Eos, the Earth 
Observing System on the Space Station Complex. The ability to browse Eos data 
and to select only that portion of the data relevant to a given need will be 
important. 

Others have noted the utility of browsing capability in the conduct of 
scientific research. The prestigious Committee on Data Management and 
Computation (CODMAC) of the National Academy of Sciences, in its goals and 
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priorities to maximize scientific return on our nation’s investment in space 
science recommended more emphasis be given to development of "electronic 
browsing capability, allowing users to explore data files via communications 
links". Consequently, they recommended that browsing use quick-look low 
resolution data (CODMAC, 1982; CODMAC, 1986). The issues concerning a browsing 
function, as well as descriptions of the BROWSE testbed, are presented in 
several conference papers written the RSRU staff (Star et al., 1987; Star et 
al., in press; Stoms et al., 1988). 

2.3. Pilot Data Systems and EosDIS 

Guidelines for the BROWSE testbed specified that it should consider 
multidisciplinary needs of the scientific community. Therefore, BROWSE is a 
prototype module that could be incorporated into any of the NASA pilot data 
systems for land, climate, oceans, and planets (PLDS, PCDS, NODS, and PDS) and 
into the Eos Data and Information System. The larger issues of data management 
in these information systems is being addressed by the pilot studies. BROWSE 
deals with the quick-look aspect that is common to all. 

3. BROWSE Subsystems - General Explanation 

This section gives an overview of the subsystems constituting the BROWSE 
facility. In terms of physical facilities, BROWSE consists of three components 
—a host facility at UCSB, the user’s remote terminal, and the communications 
link between them. At the host facility resides the user interface, the 
database management system (DBMS) with its databases, and a graphics program 
for selecting geographic area. The user’s terminal contains the image display 
and the communications software. An off-line image processing system is used 
to preprocess browse images. A project objective was for rapid prototyping so 
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existing public domain software was used where possible. Details about the 
operation of each component is described in chapter 4. 

3.1. Database Management System and Databases 

The relational database of data about available imagery is maintained at 
the host facility (in the future, each node will maintain a database of its own 
holdings). We selected RIM (Relational Information Management-JPL Version 4.5) 
for the DBMS. The NASA Ocean Data System at the Jet Propulsion Laboratory also 
uses RIM, which originated as a prototype at Boeing Commercial Airplane Company 
under authorization of NASA contract NAS1-14700. JPL added certain 
enhancements such as a multiuser capability and Fortran interface subroutines 
for relational algebra. RIM can be executed in standalone mode from a VAX/VMS 
prompt and has its own on-line help. For details on RIM, see "Relational 
Information Management (RIM) User’s Guide-715-604" (Jet Propulsion Laboratory, 
1985). BROWSE saves the user from having to learn RIM’s command language or 
the details of the database structure by providing a menu-driven front-end, 
coded in a Fortran applications program. BROWSE has two databases-one about 
specific images at the host node (INVENTORY) and another about data collections 
at specific nodes (DIRECTORY). 

3.2. User Interface System 

The user interface is a set of Fortran subroutines that prompt the user 
for the desired attributes of the imagery and formulate the appropriate 
database queries. This makes it easier for the user unfamiliar with either RIM 
or the specific database structure. The drawback of such an approach is that 
the queries are restricted to predetermined primary attributes. Consequently, 
flexibility is traded-off for ease of use. 
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3.3. Image Display System 


This software module, named IDS for Image Display System, is loaded into 
the user’s terminal from floppy disk. Currently this software is written only 
for IBM PC-AT with 640 kbytes of memory. Using a windowing environment, it 
displays the browse image that has been transmitted from the host to the 
terminal. Statistics and a histogram are simultaneously displayed with the 
image. Limited image processing functions, basically contrast stretching, are 
provided. The intent is on display, not image analysis. Therefore, only 
preprocessed imagery can be browsed. At this point, IDS is programmed to 
interpret headers from ERDAS files. ERDAS is a commercial image processing 
system. Any image display software available at the user’s local worksite can 
be substituted for IDS, if desired. 

3.4. Communications System 

Access to the BROWSE system is available over a regular telephone line or 
network access via SPAN/DECNET and ARPANET/NSFNET. Transfer of browse files is 
provided using these network protocols. For dial-up, terminal emulation 
software, such as VTERM and KERMTT, can be used to access the host node. Both 
of these software packages provide both the KERMIT protocol for transferring 
browse files and a TEKTRONIX emulator for the graphics program for selecting 
geographic area. 

3.5. Image Processing System 

The browseable image data on-line at the host facility has been 
preprocessed locally, using a commercial PC-based image processing system. 
Imagery has been subset from large data files, processed and often compressed, 
and transferred on magnetic tape to the host minicomputer. 
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3.6. BROWSE Facility Hardware 


Standard hardware has been selected for the testbed facility. The host 
facility at UCSB consists of a Micro VAX II workstation with approximately 700 
megaBytes of disk memory, a dual-speed 9-track tape drive, and a VT260 
terminal. Currently, we are using a Telebit Trailblazer Plus modem with up to 
9600 baud capability for dial-up access to the testbed. The computer is 
networked to an IBM PC-AT. 

3.7. Hardware Requirements 

The BROWSE testbed was designed to accommodate standard commercial 
hardware. For the user only interested in querying the database without using 
the electronic map or IDS, a PC with VT100 emulating software and a modem is 
adequate. To use the graphics and image display packages with an IBM-PC 
requires the addition of a Tek 4010/4014 emulator, cursor control, an EGA 
(Enhanced Graphics Adapter) board, and at least 640 Kbytes of RAM. Any 
terminal with Tektroniks 4010 capability can use the graphic mode of BROWSE 
(i.e., the electronic map). 

4. Detailed Description of the Use of the BROWSE Testbed 

This section describes the operation of the BROWSE components outlined 
above. It is written directly to the user in terms of procedures, prompts to 
expect, and general navigation through the system. 

4.1. Logging On to BROWSE 

There are three methods to connect to the BROWSE facility: 

1. From your local personal computer with a modem, dial 805-961-8409 using 
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VTERM. You will need a terminal emulator for VT100 and/or TEK4010. Our 
modem has 9600 (PEP protocol), 2400, 1200, and 300 baud capability. 

2. ForARPANETorNSFNETusers, type TELNETSBRSRU.UCSB.EDU. 

3. For SPAN/DECNET users, type SET HOST 45177. 

At the appropriate point, the system will respond with: 

SBRSRU - UCSB Remote Sensing Research Unit VAXstation II/VMS 4.5 

Username: 

Password: 

To these prompts, you should enter GUEST and J_DODGE respectively. Note that 
this password lets you query the database but not to do any irreparable damage 
to it or any other files on the computer disks. If someone else is already 
running BROWSE, you will see a message to this effect on your terminal. Please 
try again later. Normally, though, you will see some logon greetings. The 
Browse program runs automatically. First you will be asked what terminal type 
you are using. This is necessary for the program to provide appropriate 
control codes such as for clearing the screen. Next you will see the BROWSE 
welcome banner. The bottom line of the banner lists the current number of 
records in the INVENTORY database and the number of on-line browse files. Just 
hit a carriage return at the prompt to continue. Then you will be asked for 
your name, institution or agency, your professional discipline, computer type, 
and connection mode for our records. The connection mode is also used to 
select the appropriate protocol for browse datafile transfer. 

4.2 User Interface 

The user interface consists of a set of menus and prompts to guide you 
through the process of querying either of the two databases. From menus, enter 
the number of the option you want. Other prompts are relatively self- 
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explanatory. The main menu, shown in Figure 1, lets you choose to query the 
databases (the usual option). QUIT logs you out of the BROWSE testbed. HELP 
presents a menu of terms to choose from and then the associated definition for 
your choice. The HELP function is merely a prototype at this stage and only 
provides definitions of terminology. The MESSAGES option provides a forum for 
user feedback as well as update notes from the system developers. 


UCSB-BROWSE 


Build 

B Revised 

09/01/8B 


MAIN MENU 


1 . 

QUERY DATABASES 


a. 

BROWSE IMAGES 


3. 

HELP 


a. 

COMMENTS 8. MESSAGES 

5. 

QUIT BROWSE 



ENTER CHOICE 

NUMBER: 


Figure 1. Main Menu 

Normally, you will pick option 1, selected by typing the number and a carriage 
return. The next menu, QUERY, lets you select which database you want to 
consult. INVENTORY contains data about specific images and DIRECTORY generally 
describes the holdings at BROWSE nodes, whether on-line or off. The USER 
database would contain information about applications scientists who may be 
able to provide advice or collaboration on specific disciplines, sensor data. 
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or geographic regions. For the rapid prototype, the USER database is not 
implemented. 

4.2.1 Using the INVENTORY Database 

The INVENTORY menu presents the choice of attributes that you may use to 
restrict a query. While this limits your flexibility in manipulating data, 

hopefully the gain in ease of querying on the most commonly used attributes is 
a reasonable trade-off. These attributes include geographic area, 
platform/sensor, date of acquisition, cloud cover, and on-line status. Other 

options allow you to view the current values of query attributes (including 
defaults if unchanged) and the summary of the current set of records retrieved 
from the INVENTORY, or if desired, the complete record of each image retrieved. 
You can also save these records in a disk file for later viewing or printing. 

*** Warning*** In this version of BROWSE, you should only make one query 
based on each attribute. Each new query projects from the previous one. So if 
you try, for instance, to query on platform name "Landsat", followed by a query 
on platform name "Aircraft", the intersection will be an empty set. On the 
other hand, queries that narrow a previous search, such as a shorter date 
range, will give the expected results. 

Prompts for the values of query attributes are essentially self- 
explanatory. For platform name, you are presented a list of platforms. You 
simply enter the number associated with your choice. The "All" choice makes no 
reduction in the current set of records retrieved. Once you have chosen a 
platform, the choices for sensor are limited to those available on that 
platform, preventing you from trying to pick an AVHRR sensor on Landsat. (In 
the future, the sensor attribute may be modified so that you can specify a 
sensor type, e.g., "Microwave" as well as the specific name such as "SIR-A"). 
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The geographic area of interest can be specified in either graphic or 
keyboard mode, depending on whether you select TEK or VT100 mode at the 
beginning of the session. Graphic mode lets you use a mouse (or cursor arrow 
keys) to select a bounding rectangle on an electronic map display. First, you 
must select an option from the menu in the upper left corner of the display 
window by positioning the cursor on your choice and hitting the < RETURN > key. 
A ">" will indicate your selection. Usually, your first choice will be Zoom, 
to pick a specific region. Select the northwest and southeast corners of an 
area by moving the cursor to the desired location and press the < RETURN > key on 
the keyboard. The map will be redrawn, zoomed to the area you have delineated. 
This can be repeated as often as you like, and you can unzoom back to the 
previous map or to the globe if you are unsatisfied with the area drawn. As 
you zoom, increasing detail is added to the map (at least for selected regions 
to demonstrate the concept). See Figure 2 for an example of the electronic 
map. When you have the area you want, select option 4 from the menu which 
queries the INVENTORY based on your selected coordinates. The keyboard method 
in VT100 mode prompts you to type in the northwest and southeast coordinates 
from the keyboard in the exact format described on the terminal. 

For specifying acquisition dates, you have several options. You may 

select a start and end date of a range, a specific date, or a specific month, 
for instance, if you wanted June scenes of an area but the year was 
unimportant. 
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UCS8 BROWSE 


WORLD PL.OT 



I 

I 

I 

I 

I 

I 


Figure 2. Example of Electronic Map Selecting 
a Region around Santa Barbara, California 

The cloud cover attribute simply sets a maximum limit of the percentage of 
cover you will tolerate. Some scenes have an unknown cloud cover and are coded 
with a value of -1 in the INVENTORY database. Therefore, they will be included 
in all queries restricted by cloud cover but the user is cautioned about their 
significance. 
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When you have finished specifying attributes, or after any single query, 
you can examine the selected records from the INVENTORY by selecting option 7 
from the menu. A summary of each record is displayed, one screenful at a time, 
for ease of reading. You have the option of sorting the output by several 
attributes before displaying the records. You can also type Q to leave the 
display and return to the INVENTORY menu. At the end, you are asked if you 
want to see details of any particular record (an example of which is shown in 
Figure 3). If you do, enter the number of the desired record (not the scene ID 
number but the number on the far left of each row). You will also be given the 
opportunity to save that record in a disk file for later viewing or printing. 
This may be helpful for remembering the image file names or attributes. You 
can select as many records as you like to be saved. 


T) SCENE ID: YSI28I 18843X8 

PHYSICAL OESC : SO. CAL COAST 
HISTORY: HIA/VIS.SUISET 
QUALITY: NULL 

FILE POINTER: t . TH 1SANTA8844R3 . RAT 
FORHAT: DISITAL 


OATE: 12/14/84 
PLATFORM: LANOSAT 
MISSION: 5 
SENSOR: Th 
CLOUD COVER: #/ 

ON-LINE? ON 


CENTER LATITUOE : +04.599998 
CENTER LONGITUDE : -119.815555 
PLACE NAME: Sarvta Baroara,CA 


X-SRID: 42 
T-GRID: 05 


UCSB-BROMSE 


Mould Uou like to save this record in a disk file? C r /H 3 = n 

UsinS'the standard VAX OCL COPY coiaand 

At the To: , type 9our address/path/f i le in foraat 
node*userna«e Password*:: device; CpathJfi le .ext 

0i rector y soisn : cirohse .thj 

SANT AB844R3 . RAT; 1 1545 15-SEP-198? 19:23 

Total of 1 file/ 1545 blocks. 

Mould you like to transfer this Browse file to Uour terminal? CY/NJ: 

Figure 3. Example Results of a Query for Data 
of Santa Barbara, California 


After looking at a detailed record of an on-line browse file, you will be 
asked if you want to transfer it to your terminal. If you do, BROWSE invokes 
the appropriate file transfer protocol for your type of connection (KERMIT for 
dial-up users, COPY for DECNET or SPAN, and FTP for the TCP/IP networks). For 
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dial-up mode, BROWSE will automatically invoke KERMIT and pass the filename 
found in the INVENTORY record. For a user with V TERM on a PC compatible, you 
must press the ALT and ‘K’ keys simultaneously on your PC keyboard, when you 
see characters like ,Sp/@-#Yl~Y. At the VT- KERMIT > prompt, type receive. The 
transfer at 9600 baud may take 5 to 15 minutes, depending on the file size. 
When the transfer is complete, the PC will beep at you and give you another 
HERMIT prompt. Type connect to return to BROWSE. 

Alternatively, for SPAN/DECNET users, the DCL COPY command is invoked with 
the browse filename. VMS will prompt you with a To: at which point you should 
enter a complete path/filename where you want the file to be placed. Use the 
format: node"logname password"::device:[directory]filename . For TCP/IP users 
on ARPANET or NSFNET, you will be prompted for several items. At the FTP 
AWAITING HOST> prompt, type SET HOST computer name. At the FTP> prompt that 
results, type LOGIN username, and then your password at your home computer 
system when requested. Note that this transaction is occurring on the system 
level so your password is not being intercepted by the BROWSE system. Then at 
the FTP > prompt type SEND/TYPE = IMAGE filename, using the browse filename. When 
the FTP> prompt returns, type EXIT to continue. Please let us know if you have 
problems with these. Leave a comment in the BROWSE Messages utility or send 
mail to star@sbrsru.ucsb.edu or NASAMAIL JSTAR. 

After reviewing the records from the INVENTORY and transferring browse 
files to your terminal, select option 8 on the INVENTORY menu to return to the 
Main menu. 


4.2.2. Contents of the INVENTORY Database 

For the testbed, the INVENTORY contains about 600 records for digital 
imagery archived at the UCSB Remote Sensing Research Unit. A sample of these 
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scenes are in browseable form. Additional scenes of AVHRR sea surface 
temperature data for the North Atlantic were provided by Peter Comillon at the 
University of Rhode Island. 

For each scene, the database describes attributes for scene ID number, 
center coordinates, cloud cover, on-line status, processing history, image 
quality, and the full path /filename of the image file. The full database 
schema is available on request. 

4.2.3. Using the DIRECTORY Database 

The DIRECTORY database is similar in operation to the INVENTORY database. 
However, the DIRECTORY describes collections of data, emphasis of the 
collection, and the address of the facility. Query options in the DIRECTORY 
menu include platform/sensor combination, geographic area of emphasis, and data 
format. Conversely, a user can specify a specific node to see what is there. 
As in the INVENTORY, you can display query attributes and the results of the 
query. At this point, only the UCSB Remote Sensing Research Unit is on-line in 
our network. Therefore, accessing other facilities must be done externally to 
BROWSE. 

4.3. Displaying BROWSE Images 

When you have finished querying the databases and are ready to look at the 
images you have selected, choose option 5 on the main menu to leave BROWSE. 
You will automatically be logged out. However, on a PC using VTERM, you need 
to type control/break to return to DOS. Then run your local image display 
software. For IBM PC-AT users, we can provide a copy of IDS, a public domain 
image processing system. It has capabilities for displaying image data with 
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eight colors, plus a histogram, and statistics. All the commands are from pop- 
up menus, including options for contrast enhancement. Details of IDS 
operations are available on request. Of course, your can display the image on 
your own in-house image processing system, provided you can read files with an 
ERDAS header. We can provide assistance in changing file formats; contact us 
at the address on the cover. 
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Image Display System 


The BROWSE Image Display System (IDS) is an interactive software 
package that has been designed to be used for quick viewing of 
image data on a IBM PC type computer. The system includes a set 
of disk file I/O operations, as well as image display and 
enhancement routines which allow the user to view and assess the 
quality of image data to in order to better assess it's 
suitability for a specific application. IDS is configured in a 
menu driven format for easy use, and extensive error checking is 
included to aid the user in avoiding inadvertent errors. 
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System Description 


Hardware 

The hardware configurations supported by the system include 
most standard IBM PC devices, except that only "high-resolution” 
bit-mapped displays are supported, i.e. where the vertical 
resolution on the screen is at least 350 pixels. Input devices 
supported include several types of mice, the keyboard, and disk 
files. Output devices include most PC-compatible high-resolution 
bit-mapped CRT displays (e.g. IBM EGA, Tecmar Graphics Master, 
Hercules, Sigma 400) . The system requires a computer equipped 
with a harddisk and at least 512 Kbytes of RAM. 


Software 

The IDS package is an interactive image display system which 
can be used to view digital images from remote sensing sources. 
In this regard, the operations comprising the system can be 
grouped into the following four classes: user interface, file 
manipulation operations, image enhancement routines, and output 
procedures . 

i - User Interface 

The system is designed for use by individuals who may have 
little or no experience with micro computer systems. Therefore, 
the user has been provided an user interface which is both easy 
to use and largely (I) foolproof. The former was achieved by 
implementing a menu format for the user interface, the latter 
through extensive error checking of user inputs from the 
keyboard. 

The user interface consists of a bit-mapped display which is 
divided into several different static windows which contain the 
primary information displays for the system: the image itself, a 
histogram of the current image (or some subset) , a text box 
displaying elementary statistics for the current image (or some 
subset) , a title box, and two small windows which contain either 
co-ordinates or system status information. 

Two levels of temporary ("pop-up") menus are available to the 
user. The main menu is quite large and intended as a quick 
reference to the available commands and need not be invoked to 
obtain the secondary menus. The secondary menus must be opened in 
order to use the more specialised functions (e.g. enhancement, 
filtering) . 

For short queries and responses a small "dialog box" at the 
bottom of the display is used. For more complex queries. 
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responses, and status/progress reports, a large window is 
temporarily opened. In all cases user keyboard input is 
carefully checked and entry errors trapped. Graphical input 
(i.e. pointing functions) may be performed with either the 
keyboard cursor keys or a mouse. At virtually any time during 
matrix operations or data input the user may abort the current 
step, restoring the system to its previous state. 

ii - Files and Data Structures 

The system was designed to read and write image data to and 
from disk. Consequently, a critical component of IDIPS is a set 
of file input/ output operations. These operations allow the user 
to save a data file to disk after performing enhancement 
operations on that file, and to read a new data file into main 
memory from disk for display and/or manipulation. IDS is byte 
oriented and thus is designed to operate on images with pixel 
values in the range from 0 to 255. 

iii - Enhancement and Filtering Operations. 

The image processing routines consist of a small but 
comprehensive set of basic routines for the purpose of 
interactive image enhancement. The choice of routines was 
critical to the utility of the system, given the wide range of 
possible routines which could have been implemented. In 
addition, a set of statistical aids were included to aid the user 
in the assessment process. 

The IDS package is designed for the interactive assessment of 
digital images. The operations which have been implemented are 
subdivided into the following classes: 

- statistical aids 

- image enhancement 

- spatial filtering 

For reasons of modularity, the latter two sets of operations 
have been allocated individual sub-menus. The statistical aids 
are executed from the main menu. The following chapters outline 
the various features which have been implemented and describe the 
algorithms employed (where appropriate) . 
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Digital images contain statistical information which is not 
visually obvious, but which may be highly useful in the 
assessment of image quality. The following aids have been 
incorporated in the IDS package. 

- image histogram 

- image parameters: minimum, maximum, mean, 

mode, median, standard 
deviation, number of pixels 

- pixel extraction 

The image histogram and image parameters are included as part 
of the main IDS display and are continuously present on the 
screen. Various keystroke combinations allow the user to update 
these displays for the entire image, or for any training 
rectangle which may be interactively specified. Pixel extraction 
enables the user to determine pixel frequencies, percentages, and 
class bounds for any user-specified rectangle. 
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Image enhancement involves the pixel by pixel transformation 
of digital images in order to improve visual discrimination of 
image components. This is achieved by transforming or 
'stretching' the image data according to a specific mathematical 
function which is applied over the entire range of pixel values 
within an image, or to a subset thereof. The pixel by pixel 
nature of the transformation involved permits the image 
enhancement procedure to be implemented using an efficient table 
lookup algorithm. 

Three image enhancement routines have been implemented in IDS; 
linear enhancement, piecewise enhancement, and automated 
enhancement. In each case a linear transformation function of 
the form below is employed. Out-of-range values are set to the 
end-of-range value. The general form of the transformation is: 

i-low where i = pixel value 

T(i) = * 255; low = low range delimiter 

high-low high = high range delimiter 

Linear contrast enhancement will enhance a single user specified 
range of pixel values over the range from 0 to 255. Piecewise 
contrast enhancement is designed to operate on images with 
asymmetric pixel distributions. The stretching operation is 
applied to two distinct ranges of pixel values thereby reducing 
histogram asymmetry and improving image contrast. 

Automated contrast enhancement will saturate a user specified 
percentage of pixels high (255) and likewise low (0). The user 
is prompted for the percentage of pixels to saturate in each 
case, and the cumulative frequency distribution of the given 
image is used by the routine in order to determine the actual 
range of pixel values to stretch. 
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Filtering techniques can be employed for purposes such as 
noise suppression, edge enhancement, or directional enhancement 
of image features. These techniques, like contrast enhancement, 
perform a pixel by pixel transformation on images. However, 
spatial filtering, unlike contrast manipulation, is context 
dependent in that the transformation applied to any pixel is 
dependent on the values of its neighbouring pixels. As a result, 
filtering techniques are more computationally intensive than are 
image enhancement techniques. To partially offset this 
computational burden an efficient box filter algorithm has been 
implemented for those filters where possible. 

All of the filters included in IDS employ a 3 x 3 convolution 
matrix. The following filters have been implemented: 

smoothing - each pixel is set to mean of 8 neighbors 

median - each pixel is set to median of 8 neighbors 

mode - each pixel is set to mode of 9 pixels 

edge enhancement - each pixel is set to three times its own 

value minus the sum of non-diagonal 
neighbour pixels 

user defined - allows the user to specify his or her own 

convolution matrix. 

The first three filters allow the user to smooth or remove random 
noise from an image. Edge enhancement acts to emphasize large 
gradients between adjacent pixels values thereby enhancing edges 
within an image. User defined filtering enables the user to 
apply any 3x3 convolution matrix which may be appropriate for 
specific applications. 
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Introduction 

The following commands are available in the IDIPS system. 
They are presented in the order in which they appear in the 
various menus, i.e. in logical groupings rather alphabetically. 

1 - Matrix Operations 

Alt-L : Load Matrix 

This loads a valid DOS file into one of the available 
bands. The system will request a valid file-name , and look 
for a header with image parameters appended to the file. If 
a header format is encountered with which the system is 
familiar, the the system will use this information to 
automatically load the data (if possible) . Otherwise the 
user is promtpted to manually enter the appropriate 
information (ie. number of lines and samples) . 

Alt-S : save Matrix 

This saves the current band-matrix into a file whose name 
is specified by the user. The user is prompted for a valid 
file-name. 

Alt-H : Help 

This opens a small window on the screen and presents the 
user with a branching menu of items upon which help is 
available. In fact, the help information is simply an edited 
version of this document, with branches to locations 
appropriate (??) to the user's needs. 

Alt-Q S Quit 

This allows exit of the system without saving either the 
parameter-record or the currently loaded image data. 


2 - Rectangle Specification 

Alt-T - Training Rectangles 

A number of system operations (e.g. pixel extraction) 
require that the user select rectangular areas of the image 
to delimit either the area to be sampled, operated upon, or 
both. Pressing Alt-T initiates a procedure in which the user 
may interactively create, save, edit, and/or delete up to 8 
different rectangles. 

The user is prompted to select the training rectangle to 
be "edited'' . If it does not yet exist the user is prompted to 
specify, using a pointing device (keyboard or mouse) , two 
opposite corners of the rectangle. The user may also change 
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a previously allocated rectangle by specifying its number, or 
delete any one by pressing <D> and giving a valid number at 
the prompt. One may also delete all of them by pressing <X> 
and confirming the request. One may also save them all to 
disk by pressing <S> and specifying a valid file-name 
(extension not required or necessary - '.RCT' automatically 
added) . 

Alt-J : Display Window 

This allows the user to interactively specify the desired 
display rectangle within the current matrix dimensions. The 
user is prompted to use the pointer (mouse or keyboard) to 
select the opposite corners of the new display rectangle. 
When two corners have been selected, the system will 
automatically regenerate the main image window with the new 
display co-ordinates. 

Note that if the specified rectangle is larger than the 
image itself (i.e. in the area where the axes and labels are 
drawn) the system will interpret the "oversize” specification 
as a request to increase the current rectangle size. This 
provides an easy method for expanding the display back to 
full dimensions after it has been "zoomed in" on some 
particular feature. To restore the display to full 
dimensions, simply specify the corners as the farthest 
lower-right corner and farthest upper-left corner. 


3 - Ppdate/Output Requests 
Alt-6 : Update All 

This causes the system to re-calculate the statistics for 
the current band-matrix, update the main image, the 
elementary statistics window, the histogram and ogive. 

Alt- I : Update Image 

This causes the system to update the main image display, 
but does not cause it to update any of the statistics, either 
the calculations or the displays. 

Alt-1.. 9 : Update Training Rectangle Statistics 

This causes the system to calculate the current statistics 
for the specified training rectangle, n, (where <Alt-n> was 
pressed) and display them in the elementary statistics window 
and as a histogram and ogive. 

Alt-0 : Update Matrix Statistics 

This causes the system to re-calculate the statistics for 
the current data file, using the current operations 
rectangle . 

Alt-s : update Display Statistics 

This causes the system to calculate and display the 
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statistics of the current data file based on the current 
display rectangle. 

Enhancement Menu 


1 - Linear Stretch 

This procedure uses user-specified lower and upper bounds 
to perform a linear stretch on the current image data over 
the indicated data range. The user is prompted for the two 
values and the enhancement is then performed. Values below 
the lower boundary are set equal to the lower boundary while 
value above the upper limit are set equal to the upper 
boundary . 

2 - Piecewise Stretch 

This procedure allows the user to apply differential 
linear stretching to different parts of the available 
bandwidth. The user is prompted for the number of ranges to 
be stretched, and is then prompted to enter the lower bound 
of each class. 

3 - Automated Stretch 

This procedure allows the user to specify the percentage 
of the observed frequency distribution to be saturated either 
low or high. The user is prompted for the percentage to be 
saturated low, and then the percentage to be saturated high. 
The routine uses the observed frequency distribution to find 
the data values corresponding to the specified degree of 
saturation, and then performs a linear stretch over that 
range . 
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5 - Filter Menu 


1 - Smoothing (Moving Average) 

This procedure uses a moving average (sum of all pixels 
in 3x3 cell divided by 9 placed in center cell) to smooth the 
pixel by pixel variation in the current image data. The user 
is prompted for the destination (band) of the result. 

2 - Median 

This procedure uses the median value of each 3x3 cell to 
replace the central value of each cell. The user is prompted 
for the destination (band) of the result 

3 - Edge Detect 

This filter enhances the spatial variation in pixel 
values, thus enhancing areas of rapid changes, or "edges". 
The filter operates by replacing the central value of the 
cell by the 8 times the central value less the sum of the 
other eight members of the cell. The user is prompted for 
the destination (band) of the result. 

4 - Laplace 

This procedure performs a convolution of the current 
image data using a 3x3 Laplacian filter, i.e. where the 
central value is replaced by 9 times the central value less 
the sum of the other eight values. The user is prompted for 
the destination of the result. 

5 - User-Specified 

This procedure allows the user to specify any values he 
wishes to be used as a mask/filter in a convolution over the 
current image data. The user is first prompted for the 9 
values which make up the 3x3 cell, then the destination 
(band) of the result. 

6 - Pixel Extraction Menu 

1 - Point Sampling 

This option allows the user to "roam" the present image 
with the cursor in order to examine, in detail, the 
distribution of values in the image. The user may exit the 
procedure by pressing <ESC>. 

2 - Profile Sampling 

This option the user to extract linear profiles along a 
specified transect. The user is prompted to move the cursor 
to the first endpoint of the profile and then press <CR>. As 
the cursor is moved to the second endpoint the system will 
sketch ("rubber band") the shape of the profile as the user 
moves the cursor. When the second endpoint is selected the 
system will redraw the entire final profile, draw a trace on 
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the image itself indicating the position of the transect, and 
then exit. 

3 - Area Sampling 

This option allows the user to select a rectangular area 
(either a previously allocated training rectangle or a user- 
specified rectangle) and extract summary statistics about the 
distribution of pixel classes. The classes used are those 
currently specified for the image, hence a maximum of eight. 
The user may, however, specify a different number of classes 
if he wishes. The user is initially prompted to specify a 
rectangle to be sampled, either the entire matrix, one of the 
training rectangles, or an interactively specified rectangle 
(option "W") . The resulting class histogram is plotted in 
the histogram window and the class statistics (text) are 
displayed in a window in the upper right. The area specified 
in the last column of the summary statistics are expressed in 
terms of "square pixels". 
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APPENDICES 


Appendix A - System Installation 


System Files 

The IDS program consists of 4 main files: 

IDIPS1.COM 
IDIPS1.001 
IDIPS1 . 002 
IDIPS1 . 002 

In addition, the system must have access to the three font 
files: 

ROMANS IM.FNT 
SYSTEM08 . FNT 
MARKERS . FNT 

Also, the system must have access to the graphics driver 
software: 


METAWNDO.EXE 

All of these files must be in the directory from which IDIPS 
is started. 


Setting Up the System 

It is highly recommended that you load all of the IDS files 
and the font files into their own subdirectory of your hard 
disk and start the program from there. Two batch file shave 
been provided, HDINSTAL.BAT and IDS. BAT. The former dimply 
creates an \IDS subdirectory and loads all the files into it, 
and puts the IDS. BAT file in the root directory. The IDS. BAT 
file loads the MetaWindow driver and invokes IDS. 



